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Polypeptides blologlquement actlfs Inseres dans une albumlne 



La pr6sente invention concerne de nouveaux polypeptides biologiquement 
5 actifs, leur preparation et des compositions pharmaceutiques les contenant. 

Quoique possSdant une ou plusieurs activit6s th6rapeutiques potentielles, de 
nombreux polypeptides ne peuvent, malheureusement, pas §tre explores 
pharmaceutiquement. Ceci peut avoir diff6rentes raisons, telles que notamment leur 
faible stability in vivo, leur structure complexe ou fragile, la difficult^ de les produire & 
10 une gchelle industriellement acceptable, etc... De mSme, certains polypeptides ne 
donnent pas les r6sultats attendus in vivo en raison de probldmes d'ad ministration, de 
conditionnement, de pharmacocin&ique etc... 

La pr6sente invention a pr6cis6ment pour objet de rem6dier & ces 
inconv6nients. 

15 Elle vise notamment l'6laboration de prolines artificielles biologiquement 

actives permettant une exploitation optimale, sur le plan th6rapeutique, des propri6t§s 
biologiques de ces polypeptides. 

La Demanderesse a ainsi mis en Evidence qu'it est possible d'ins6rer par voie 
gen§tique toute structure active, d§riv6e d'un polypeptide biologiquement actif, dans 
20 une autre structure prot&que constitute d'albumine, sans en alt§rer lesdites 
propri6t§s biologiques. De mani&re inattendue, la serum-albumine humaine permet de 
presenter efficacement la structure active & ses sites d'interaction et d'assurer une 
stability plasmatique 6lev6e au polypeptide recombinant de I'invention. 

* 

Plus pr6cis6ment, la pr§sente invention conceme un polypeptide 
25 recombinant comportant au moins une partie active d6riv6e d'un polypeptide, naturel 
ou artificiel, biologiquement actif, g6n§tiquement ins<§n§e dans une albumine ou un de 
ses variants ou d6riv6s. 

Par variant de I'albumine, on entend designer selon la pr6sente invention 
toute proline & haute demie-vie plasmatique obtenue par modification, & 1'aide des 
30 techniques du g6nie g6n6tique, d'un gdne codant pour un isomorphe donn6 de la 
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s§rum-albumine humaine, ainsi que touts macromol§cule & haute demie-vie 
plasmatique obtenue par modification in vitro de la prot§ine cod§e par de tels g&nes. 
(Par modification, on doit entendre toute mutation, substitution, d6l6tion, addition ou 
modification de nature g§n6tique et/ou chimique). L'albumine 6tant trds polymorphe, 
5 de nombreux variants naturels out §t6 identifi6s et r§pertori6s [Weitkamp L.R. et al., 
Ann. Hum. Genet. 2Z (1 973) 219]. 

En ce qui conceme les aerivSs d'albumine, il s'agit plus particulidrement de 
mol6cu!es comportant tout ou partie de l'albumine, fusionn§e le cas 6ch6ant k au 
moins une sequence polypeptidique provenant d'un g§ne nature! ou artificiel, elle 
10 mSme dotee ou non d'une activity 'ciologique. 

Dans la suite de la description, les diff6rents types d'albumines explicit6s ci- 
dessus sont commun6ment d§sign6s sous le terme albumine. 

Au sens de la prdsente invention, il est entendu par parte active, une parte 
poss§dant une activit6 qui peut sort §tre directe (traitement des maladies, diagnostic, 
15 recherche biologique, capteurs...}, ou indirecte (par exemple utilisable dans la 
pr6vention des maladies, dans la conception des vaccins, dans les techniques de 
I'imagerie mddicale etc.). 

Les parties actives de polypeptides biologiquement actifs, ins6r6es selon 
I'invention, pr6sentent de pr6f6rence un int6r§t th6rapeutique. 

20 Les polypeptides poss6dant une activit6 th6rapeutique peuvent 3tre d'origine 

humaine ou non. 

A titre repr6sentatif des polypeptides d'origine non humaine, on peut citer des 
peptides ou leurs d6riv6s, poss&dant des propri6t6s potentiellement utiles dans les 
pathologies des compartments sanguins et interstitiels, tels que I'hirudine, la 
25 trigramine, I'antistatine, les peptides anticoagulant des tiques (TAP), I'aridtine, 
I'applagine etc.... 

Selon un mode privil§gi6 de I'invention , le polypeptide ayant une activity 
th6rapeutique est un polypeptide d'origine humaine ou un variant moteculaire. Par 
exemple, il peut s'agir de tout ou partie, d'un enzyme, d'un inhibiteur tfenzyme, d'un 
30 antig&ne, d'un anticorps, d'une hormone, d'un r6cepteur, d'un facteur intervenant 
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dans le contrdle de la coagulation, d'un interferon, d'une cytokine [les interleukines, 
mais aussi leurs variants antagonistes naturels de leur fixation au(x) r6cepteur(s), les 
cytokines de type SIS (small induced secreted) et par exemple les protSines 
inflammatoires des macrophages (les MIPs), etc.], d'un facteur de croissance et/ou 

5 de diff6renciation [et par exeiv;pie les facteurs de croissance transformants (les 
TGFs), les facteurs de diff6rencis.tion des cellules sanguines (6rythropoi6tine, M-CSF, 
G-CSF, GM-CSF etc.). I'insuline et les facteurs de croissance qui lui ressemblent (les 
IGFs), ou encore les facteurs de perm6abilit6 cellulaire (VPF/VEGF), etc.], d'un 
facteur impliqu6 dans la g6n6sn/r6sorption des tissus osseux (OIF et ost6ospontine 

10 par exemple), d'un facteur impliqu6 dans la motilite ou la migration cellulaire [et par 
exemple le facteur de mobility autocrine (AMF), le facteur de stimulation de la 
migration (MSF), ou encore facteur de dispersion (scatter factor/facteur de 
croissance des h6patocytes)l d'un facteur bactericide ou antifongique, d'un facteur 
chimiotactique [et par exemple le facteur plaquettaire 4 (PF4), ou encore les peptides 

15 chemoattractants des monocytes (MCP/MCAF) ou des neutrophils (NCAF), etc...], 
d'un facteur cytostatique (et par exemple les proteines qui se fixent aux galactosides), 
d'une molecule adhesive plasmatique (et par example le facteur de von Willebrand, le 
fibrinog&ne etc..) ou interstitielle (laminine, tfinascine, vitronectine, etc..) ou des 
matrices extracellulaires, ou encore toute sequence peptidique antagoniste ou 

20 agoniste d'interactions moleculaires et/ou intercellulaires impliqu6es dans les 
pathologies des compartments circulatoires et interstitiels et par exemple la formation 
des thrombus arteriels et veinoux, les metastases canc6reuses, l'angiog6n6se 
tumorale. le choc inflammatoire, les maladies autoimmunes, les pathologies osseuses 
et ost&o-articulaires etc.. 

25 Bien entendu, la partie active des polypeptides de I'invention peut etre 

constitute par le polypeptide entier biologiquement actif ou par une structure d6riv6e 
de celui-ci ou encore correspondre & une sequence peptidique non naturelle, isoI6e 
par exemple k parti r de banques peptidiques aieatoires. (Pour des raisons de 
simplification, ces diff6rentes possibilit6s seront couvertes dans ce qui suit sous la 

30 designation commune "partie active (fun peptide biologiquement actif.) Au sens de la 
prfisente invention, on entend par structure d6riv6e tout polypeptide obtenu par 
modification et conservant une activite biologique. Par modification, on doit entendre 
toute mutation, substitution, deletion, addition ou modification de nature g6n6tique 
et/ou chimique. De tels derives peuvent etre g6n6r6s dans des buts diff6rents, tels 
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que notamment celui d'augmenter I'affinite de la molecule pour ses sites de fixation, 
ceiui d'am6Iiorer ses niveaux de production, celui d'augmenter sa resistances aux 
proteases, celui d'augmenter son efficacite thdrapeutique ou encore de rdduire ses 
effets secondares, ou celui de lui conf6rer de nouvelles propri6t6s biologiques. A titre 
5 d'exemple, les polypeptides chimdres de I'invention possddent des propri6t6s 
pharmacocin6tiques et une activity biologique utilisable pour la prevention ou le 
traitement des maladies. 

* 

Des polypeptides de ('invention, particulidrement avantageux, sont ceux dans 

lesquels la partie active pr6sente : 
10 . . (a) la structure peptidique entidre ou, 

(b) un fragment de (a) ou une structure d6riv6e de (a) par modification 

structural (mutation, substitution addition et/ou deletion d'un ou plusieurs r6sidus) et 

poss6dant une activity th6rapeutique. 

Parmi les structures du type (b), on peut citer plus particuli&rement les 
15 molecules dans lesquelles certains sites de N- ou O-glycosylation ont 6t6 modifies ou 

supprim6s, les molecules dans lesquelles un ou plusieurs r6sidus ont 6t6 substitu6s, 

ou les molecules dans lesquelles tous les rgsidus cysteine ont 6t6 substitu6s. On peut 

citer egalement des motecules obtenues & partir de (a) par deletion de regions 

n'intervenant pas ou peu dans I'interaction avec les sites de liaison consid6r6s, ou 
20 exprimant une activite ind6sirable, et des molecules comportant par rapport & (a) des 

r6sidus suppiementaires, tels que par exemple une methionine N-terminale et/ou un 

signal de s6cr6tion et/ou un peptide de jonction. 

L'objet de la pr6sente invention est particulidrement avantageux pour les 

peptides actifs trop petits pour former un domaine structural et/ou ne poss6dant pas 
25 une bonne stability in vivo et/ou une bonne biodisponibilite. L'insertion propos6e selon 

I'invention, permet de les associer & un ou des domaine(s) pr6-existant(s) de 

I'albumine et de b6n6ficier ainsi de la biodisponibilite et de la stabilite in vivo de celle- 

ci. 

De maniere generate, la taille des parties actives ins6r6es dans I'albumine 
30 varie entre trois k vingt cinq tesidus d'acides amines. Toutefois, des sequences allant 
de 1 residu k 100 r6sidus peuvent egalement §tre utilisees. 
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[.'insertion d*une partie active de peptide dans la sequence peptidique de 
l'albumine est r6alis6e selon I'invention de manidre k satisfaire aux deux conditions 
suivantes: 

II doit Stre pr6serv§ k ladite partie active, ins6r6e au sein de l'albumine, une 
5 accessibility suffisante afin de lui conserver intacte son activity biologique. Par 
ailleurs, la structure de 1'albumine ne doit pas non plus subir une d§stabilisation trop 
importante qui serait bien entendu pr6]udiciable au polypeptide recombinant dit la 
chim&re. 

Les sites d'insertion sont pr6f6rentiellement s6lectionn§s au sein de 

10 l'albumine en respectant les pr6c6dents imp6ratifs. 

Selon la structure cristalline publi6e par He et Carter (Nature 1992, 358, 209- 
215) l'albumine est form6e de ia r6p6tition de 3 domaines comprenant chacun deux 
sous-domaines et elle contient plus de 67% d'h&ices alpha. Chacun des domaines 
est superposable aux autres et est form§ de 10 helices notees de hi k h10. Le sous- 

15 domaine A comporte les helices hi k h6 et le sous-domaine B, les helices h7 k h10. 
Chaque sous domaine est form£ par un motif commun: hi, h2, h3, h4 pour le 
domaine A et h7, h8, h9 et h10 pour le domaine B. Les petites helices h5 et h6 
supplfimentaires sont Ii6es par un pont disulfure au sous domaine A. La figure 1 rend 
compte de manidre sch6matique de la structure de l'albumine humaine. 

20 Les sites d'insertion sont de preference localises dans les regions de 

l'albumine pr6sum6es former des regions expos6es k la surface de la molecule, ces 
regions 6tant pr6f6rentiellement des boucles. 

A titre de sites d'insertion convenant tout particulidrement k I'invention, on 
peut mentionner trots regions du premier domaine : 

25 - la region 5 qui s'dtend du r6sidu 57 k 62 et qui correspond k une boucle reliant les 
deux hdlices h3 et h4 ; 

- la region 8 comprenant les r6sidus 103 k 120 et correspondent k la zone inter sous- 
domaines. 

- la region 13 comprise entre les rdsidus 178 et 200 et qui correspond k une h6lice. 
30 Les helices h2 et h3 du domaine III d6limitent 6galement, du r6sidu 419 au 

r6sidu 430, une autre region d'insertion envisageable selon I'invention. 

Une partie active de peptide biologiquement actif peut 6tre ins6r§e selon 
trois modes diff§rents dans la sequence peptidique de l'albumine. 
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- il peut s'agir d'une insertion stride consistant en une simple addition de la s6quence 
du peptide d'int§r6t dans la sequence d'origine de I'albumine qui est conserve dans 
sa totality. 

- ^insertion peut corresponds k une substitution d'une portion de la s6quence 
5 . peptidique de I'albumine par la sequence peptidique correspondant k la partie active 

du peptide d'intdrdt. 

* 

- Enfin, il peut s'agir d'une insertion combinant une addition d'une partie de la 
sequence peptidique active et une substitution d'une portion de la sequence 
peptidique de I'albumine par le reste de la partie active du peptide actif. 

10 La figure 2 rend compte d'une manure schSmatique de ces differents modes 

d'insertion. 

Bien entendu, la partie active d'un peptide biologiquement actif peut §tre 
r6p6t§e plusieurs fois dans la chimdre au m§me endroit et/ou dans des regions 
diff§rentes de I'albumine. De m§me f il est 6galement possible d'ins6rer selon 
15 I'invention des parties actives d'rfferentes, sort issues d'un m§me peptide ou de 
peptides differents. 

Enfin, une partie active des peptides selon I'invention peut §tre ins6r6e soit 
strictement au sein de I'albumine soit entour6e de sequences de jo notion. 

* 

En ce qui concerne ces sequences de jonction, il peut notamment s'agir de 
20 sequences peptidiques riches en rdsidus glycine et/ou en r§sidus sdrine et/ou en 
r6sidus threonine et/ou tout rSsidu d'acide amin6 d6crit comme fr6quemment 
rencontr6 dans les zones de flexibility dans les prolines. 



Les polypeptides recombinants de I'invention s'avdrent tout particulidrement 
25 avantageux. 

lis permettent de maintenir dans I'organisme une activity biologique donnfee 
pendant un temps prolong^. II s'avdre ainsi possible de rdduire les doses 
administr6es et, dans certains cas, de potentialiser I'effet th6rapeutique, par example 
en r6duisant les effets secondaires consdcutifs k une administration plus importante. 
30 lis permettent avantageusement de g6n6rer et cfutiliser des structures d6riv§es de 
polypeptides biologiquement actifs tr6s petites et done tr6s sp6cifiques d'un effet 
recherchd. Ces polypeptides recombinants poss6dent par ailleurs une repartition 
particuli6rement avantageuse dans i 1 org an is me, ce qui modifie leurs propri6t6s 
pharmacocin6tiques et favorise le d6veloppement de leur activity biologique et leur 
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utilisation, lis pr6sentent 6galement I'avantage d'etre faiblement ou non- 
immunog6niques pour I'organisme dans lequel ils sont utilises. Finalement, les 
polypeptides de I' invention peuvent §tre exprimds (et pr6f6rentiellement s§cr6t6s) par 
des organismes recombinants, k des niveaux permettant leur exploitation industrielle. 

* 

5 Un autre objet de flnvention concerne un proc6d6 de preparation des 

molecules chimdres d6crites ci-avant. Plus pr6cis6ment, ce proc§d6 consiste k faire 
exprimer par un hdte cellulaire eucaryote ou procaryote une sequence nuclSotidique 
codant pour une partie active d'un polypeptide d6sir6, puis k r6colter le polypeptide 
produit. 

10 Parmi les hdtes eucaryotes utilisables dans le cadre de la pr6sente invention, 

on peut citer les cellules animales, les levures, ou les champignons. En particulier, 
s'agissant de levures, on peut citer les levures du genre Saccharomyces. 

Kluyveromvces . Pichia . Rnhwanniomvces. ou Hansenula. S'agissant de cellules 
animates, on peut citer les cellules COS, CHO, CI27, etc... Parmi les champignons 
15 susceptibles d'etre utilises dans la pr6sente invention, on peut citer plus 
particulferement Aspergillus ssp. ou Trichoderma ssp. Comme hdtes procaryotes, on 
pr6fere utiliser les bactfiries telles que Escherichia coli . ou appartenant aux genres 

Corvnebactarium. BacillUS. OU StrfttrtQmVCeS- 

Les sequences nucl6otidiques utilisables dans le cadre de la pr6sente 
20 invention peuvent §tre pr6par6es de diff6rentes manures. G6n6ralement, elles sont 
obtenues en assemblant en phase de lecture les sequences codant pour chacune 
des parties fonctionnelles du polypeptide. Celles-ci peuvent §tre isol§es par les 
techniques de I' horn me de Tart, et par exemple directement k parti r des ARN 
messsagers (ARNm) cellulaires, ou par reclonage k parti r d'une banque d'ADN 
25 compl6mentaire (ADNc), ou encore il peut s'agir de sequences nucl6otidiques 
totalement synth6tiques. II est entendu de plus que les sequences nucteotidiques 
peuvent §galement 6tre ult6rieurement modifi6es, par exemple par les techniques du 
g6nie g6n6tique, pour obtenir des d6riv6s ou des variants desdites sequences. 

Uinsertion de cette sequence nucteotidique, codant pour la partie active du 
30 polypeptide, peut §tre r§alis6e directement ou non, selon la region choisie pour site 
d'insertion, dans le gdne codant pour I'albumine. 

La region s6lectionn6e peut, en effet, ne pas comporter de site de restriction 
adgquat k la realisation de ladite insertion. Dans cette hypoth$se, il peut s'avdrer 
int£ressant, prfealablement k I'insertion, d'introduire un ou plusieurs sites de restriction 
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* 



uniques. La crfeation de sites de restriction, au niveau de la r6gion choisie pour site 
d'insertion, se fait de pr6f§rence par mutag6ndse dirig§e selon des techniques 
classiques. Toutefois on peut 6galement ins6rer la sSquence nucteotidique 
correspondant & "la partie active du polypeptide biologiquement actif directement par 
5 mutagSn6se dirig6e sans que I'irisertion ne fasse apparaitre de sites de restriction 
particuliers. 

Dans le cas particulier de la region 5 du g&ne codant pour t'albumine, la 
pr6sence du site de restriction PVull autorise finsertion directe de la s6quence 
nucteotidique. II se r6v6le done particulterement utile comme site de clonage d'une 

10 partie active d'un polypeptide que Ton souhaite ins6rer en phase traductionnelle dans 
la sequence de I'albumine au niveau du 57 6me rfesidu. 

Dans ce mode d'insertion, mettant & profit un site de restriction unique d6j& 
existant dans la sfiquence d'origine de I'albumine, la ligature de la sequence codant 
pour le peptide actif avec le fragment de restriction, correspondant & la totalit6 du 

15 g&ne codant pour I'albumine, g6n6re une s6quence nucteotidique comportant un 
g&ne hybride codant pour une proline chim^re du type SAH insertion stricte. 

Dans le cas particulier de la zone 41 9 & 430, la pr6sence de 2 sites uniques 
d'insertion, (Hindi et Arvll) permet d'ins6rer et/ou substituer les peptides d'int6rdt 
bioiogique & la sequence de I'albumine. 

20 En ce qui concerne plus particulterement les r6gions 8 et 13, I'insertion de 

ladite s6quence est favoris6e si I'on y cr6e pr6alablement un ou des sites de 
restriction manipulates. Bien entendu, les sites de restriction & cr6er au niveau de la 
s§quence sont choisis en tenant compte de la nature des sites de restriction d6j£ 
existants. lis doivent conduire & une insertion s6lective. 

25 Dans ce second mode de r6alisation, I'emploi de deux sites de restriction 

uniques permet de construire des g§nes codant pour des chimdres ayant le peptide 
actif en insertion et/ou en substitution. L'insertion du peptide se fait par rernplacement 
d'un fragment bom6 de deux sites de restriction uniques, dans I'ADN comptementaire 
de I'albumine, ins6r6s par mutag6ndse dirigSe. Ces deux sites de restriction uniques 

30 pourront 6tre, Mst I et Kpn I, dans la r§gion 8 et Sst I et Xho I dans la r6gion 13. La 
creation de ces sites de restriction peut ou non modifier la sequence polypeptidique 
de I'albumine humaine. Le clonage subs§quent du peptide actif en phase codante 
dans ce gdne de I'albumine peut 6tre exclusivement la sequence codante du peptide 
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ou peut correspondre & un panachage de la sequence codante du peptide et de la 
sequence codante du fragment d6I6t6 de Palbumine. 

La pr6sente invention vise 6galement la protection des variants de 
5 s6quences nucieotidiques codant pour I'albumine correspondante c'est £ dire 
integrant au moins un site de restriction unique non naturel c'est & dire non present 

dans la sequence d'origine. 

Avantageusement, la creation de ces sites de restriction peut servir par la 
suite & I'insertion d'un ou plusieurs parties actives de polypeptide(s) biologiquement 
10 actif(s) dans la proteine mature. 

Plus preferentiellement, dans le proc6d6 de I'invention, la sequence 
nucl6otidique fait partie d'une cassette d'expression comprenant une region d'initiation 
de la transcription (region promoteur) permettant, dans les cellules hates, I'expression 

15 de la sequence nuc!6otidique plac6e sous son contrdle et codant pour les 
polypeptides de I'invention. Cette region peut provenir de regions promoteurs de 
genes fortement exprimes dans la cellule hdte utilis6e, I'expression 6tant constitutive 
ou regulable. S'agissant de levures, il peut s'agir du promoteur du g6ne de la 
phosphoglyc6rate kinase (PGK) . de la glyc6raid6hyde-3-phosphate d6shydrog6nase 

20 (GPP) , de la lactase (LAC4 ). des 6nolases (ENQ), des alcools deshydrog6nases 
(ADHV etc... S'agissant de bact6ries, il peut s'agir du promoteur des gdnes droit ou 
gauche du bacteriophage lambda (Pl. Pr). ou encore des promoteurs des g6nes des 
op6rons tryptophane (Ptrp) ou lactose (P|ac)- En outre, cette r6gion de contrdle peut 
6tre modifi6e, par exemple par mutag6nese in vitro, par introduction d'6I6ments 

25 additionnels de contrdle ou de sequences synthetiques, ou par des deletions ou des 
substitutions des elements originels de contrflle. La cassette tfexpression peut 
6galement comprendre une region de terminaison de la transcription fonctionnelle 
dans I'hdte envisage, positionn6e immediatement en aval de la sequence 
nucI6otidique codant pour un polypeptide de 1'invention. 

30 Dans un mode pr6f6r6, les polypeptides de I'invention r6sultent de 

I'expression dans un hdte eucaryote ou procaryote d'une sequence nuc!6otidique et 
de la s6cr6tion du produit d'expression de ladite sequence dans le milieu de culture. II 
est en effet particulierement avantageux de pouvoir obtenir par voie recombinante 
des molecules directement dans le milieu de culture. Dans ce cas, la sequence 
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nucieotidique codant pour un polypeptide de I'invention est pr6c6d6e rfune sequence 
"leader - (ou sequence signal) dirigeant le polypeptide naissant dans les voies de 
s6cr6tion de I'h5te utilise. Cette sequence "leader" peut fetre la sequence signal 
naturelle du polypeptide biologiquement actif dans le cas oD celui-ci est une proline 
5 naturellement s6cr6t6e t ou celle de la structure stabilisatrice, mais il peut 6galement 
s'agir de toute autre sequence "leader" fonctionnelle, ou d'une sequence "leader" 
artificielle. Le choix de Tune ou I'autre de ces sequences est notamment guide par 
I'h6te utilise. Des exemples de sequences signal fonctionnelles incluent celles des 
genes des ph6romones sexuelles ou des toxines "killer" de levures. 

10 En plus de la cassette depression, un ou plusieurs marqueurs permettant 

de s6lectionner I'hdte recombin6 peuvent etre additionn6s, tels que par exemple le 



URA : 



antibiotiques comme la gen6ticine (G418) ou & tout autre compose toxique comme 
certains ions m6talliques. 

15 L'ensemble constitu6 par la cassette depression et par le marqueur de 

selection peut 6tre introduit directement dans les cellules hdtes consid6r6es, soit 
ins6r6 prfialablement dans un vecteur autor6plicatif fonctionnel. Dans le premier cas, 
des sequences homologues & des regions pr6sentes dans le g6ndme des cellules 
hdtes sont pr6f6rentiellement additionn6es & cet ensemble; lesdites sequences 6tant 

20 alors positionnees de chaque cdte de la cassette, d'expression et du gene de 
selection de fagon & augmenter la frequence d'integration de l'ensemble dans le 
g6ndme de I'hdte en ciblant Integration des sequences par recombinaison 
homologue. Dans le cas ou la cassette d'expression est ins6r6e dans un systeme 
r6plicatif, un systeme de replication pr6f6r6 pour les levures du genre Kluyveromvces 

25 est derive du plasmide pKD1 initialement isol6 de K.drosophilarum: un systeme 
pr6f6r6 de replication pour les levures du genre Saccharomvces est d6riv6 du 
plasmide Z\x de S. cerevisiae . De plus, ce plasmide d'expression peut contenir tout ou 
partie desdits systemes de replication, ou peut combiner des elements d6riv6s du 
plasmide pKD1 aussi bien que du plasmide 2\x. 

30 En outre, les plasmides d'expression peuvent etre des vecteurs navettes 

entre un hflte bacterien tel que Escherichia coli et la cellule hdte choisie. Dans ce cas, 
une origine de replication et un marqueur de selection fonctionnant dans I'hdte 
bacterien sont requises. II est 6galement possible de positionner des sites de 
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restriction entourant les sequences bacteriennes et uniques sur le vecteur 
d'expression: ceci permet de supprimer ces sequences par coupure et religature in 
vitro du vecteur tronqu6 avant transformation des cellules hdtes, ce qui peut r6sulter 
en une augmentation du nombre de copies et en une stability accrue des plasmides 

5 d'expression dans lesdits hdtes. Par exemple, de tels sites de restriction peuvent 
corresponds aux sequences telles que 5'-GGCCNNNNNGGCC-3' (Sfil) (SEQ ID N°1) 
ou ff-GCGGCCGC-S' (Noll) (SEQ ID N°2) dans ia mesure oO ces sites sont 
extrSmement rares et g6n6ralement absents d'un vecteur d'expression. 

AprSs construction de tels vecteurs ou cassette d'expression, ceux-ci sont 

10 introduits dans les cellules hdtes retenues selon les techniques classiques dfecrhes 
dans la Iitt6rature. A cet 6gard, toute mfethode permettant d'introduire un ADN 
Stranger dans une cellule peut fitre utilis6e. II peut s'agir notamment de 
transformation, eiectroporation, conjugaison, ou toute autre technique connue de 
Thomme de Tart. A titre d'exemple pour les hdtes de type levure, les diff6rentes 

15 souches de Klnyveromvces utilis6es ont 6t6 transform6es en traitant les cellules 
entires en presence d'ac6tate de lithium et de polyethylene glycol, selon la 
technique d&crite par Ito et al. [J- Bacteriol. 153 (1983) 163]. La technique de 
transformation d6crite par Durrens et al. [Curr. Genet. Ifi (1 990) 7] utilisant l'6thyldne 
glycol et le dimfithylsulfoxyde a 6galement 6t6 utilis6e. II est aussi possible de 

20 transformer les levures par eiectroporation, selon la m6thode d6crite par Karube et al. 
[FEBS Letters 1B2 (1985) 90]. Un protocole alternatif est 6galement dficrit en detail 
dans les exemples qui suivent. 

Aprds selection des cellules transformdes, les cellules exprimant lesdits 

* 

polypeptides sont inocuiees et la recuperation desdits polypeptides peut 6tre faite, 
25 soit au cours de la croissance cellulaire pour les proc6d6s "en continu", soit en fin de 
croissance pour les cultures "en lots" ("batch"). Les polypeptides qui font I'objet de la 
pr6sente invention sont ensuite purifies & partir du surnageant de culture en vue de 
leur caract6risation mol6culaire, pharmacocin6tique et biologique. 

Un systeme d'expression pr6f6r6 des polypeptides de linvention consiste en 
30 I'utilisation des levures du genre Ktuyveromyces comme cellule hdte, transform6es 
par certains vecteurs d6riv6s du replicon extrachromosomique pKD1 initialement iso!6 
chez *- mancianus var. rirnsnnhilamm. Ces levures, et en particulier K, lactis et JL 
fragilis sont g6n6ralement capables de r&pliquer lesdits vecteurs de fagon stable et 
poss6dent en outre Tavantage d'etre incluses dans la liste des organismes G.R.A.S. 
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('Generally Recognized As Safe*). Des levures privil6gi6es sont pr6f§rentiellement 
des souches industrielles du genre Kluweromvces capables de r£pliquer de fagon 
stable lesdits plasmides d§riv6s du plasmide pKD1 et dans lesqueis a 6t6 ins6r6 un 
marqueur de selection ainsi qu'une cassette d'expression permettant la s6cr6tion k 
5 des niveaux 6lev6s des polypeptides de I'invention. 

La pr§sente invention concerne 6galement les sequences nuclfiotidiques 
codant pour les polypeptides chim6res d6crits ci-avant, ainsi que les cellules 
recombinantes, eucaryotes ou procaryotes, comprenant de telles sequences. 

La pr6sente invention concemo aussi I'application & titre de medicament des 
10 polypeptides selon la pr6sente nvention. Plus particulidrement, I'invention a pour 
objet toute composition pharmacci.tique comprenant un ou plusieurs polypeptides ou 
sequences nucl6otidiques tels que d6crits ci-avant. Ces compositions 
pharmaceutiques peuvent se presenter sous forme de diverses formulations. II peut 
notamment s'agir de nanoparticules k la surface desquelles sont presents des 
15 polypeptides selon I'invention. Ce type de formulation est plus particulterement utilise 
pour effectuer des ciblages diriges en principe actif. Bien entedu, les sequences 
nucl6otidiques peuvent §tre utilisfees en th6rapie g6nique. 

La pr6sente invention sera plus complement d6crite & I'aide des exemples 
qui suivent, qui doivent Stre considers comme illustratifs et non limitatifs. 

20 

L1STE DES FIGURES 

Les representations des plasmides indiqu6es dans les Figures suivantes ne sont pas 
trag6es & l'6chelle et seuls les sites de restriction importants pour la comprehension 
des ctonages r6alis6s ont 6t6 indiqu6s. 

25 Figure 1: Representation sch6matique du domaine I de I'albumine avec localisation de 
sites d'insertion selon Invention. * signale la localisation du site d'insertion h2-h3 
dans le domaine III. Les chiffres 5, 8 et 13 identifient les zones d'insertion 
correspondantes. 

Figure 2 : Representation sch6matique de diff6rents modes d'insertion d'un peptide 
30 actif dans la structure de Falbumine. 
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Figure 3: Plasmide pYG105. 

Figure 4: Modification de la region 5 de I'albumine suite & I'insertion de la sequence 
codant pour IEGR telle que d6crite dans i'exemple 8.1. Les modifications figurent en 
caractdres gras. (.'emplacement des sites de restriction modifies ou apportes sont 
5 indiquds par un trait horizontal et la position de coupure des enzymes correspondant 
par un trait vertical. 

* 

Figure 5: Strategic de clonage pour ('insertion de IEQR dans la region 5 (exemple 
8.1). 

Figure 6: Strategic de clonage pour I'insertion d'un peptide actif dans la region 13 de 
10 Talbumine. 

EXEMELES 

TECHNIQUES GENERALES DE CLONAGE 

15 Les m&hodes classiquement utilis6es en biologie moI6culaire telles que les 

extractions preparatives cf ADN plasmidique, la centrifugation d'ADN plasmidique en 
gradient de chlorure de c6sium, l'6lectrophor6se sur gels d'agarose ou d'acrylamide, 
la purification de fragments cf ADN par 61ectro6lution, les extraction de prolines au 
phenol ou au ph6nol-chloroforme, la precipitation d'ADN en milieu salin par de 

20 l'6thanoI ou de I'isopropanol, la transformation dans Escherichia coli etc ... sont bien 
connues de I'homme de metier et sont abondament decrites dans la literature 
[Maniatis T. et al., "Molecular Cloning, a Laboratory Manual", Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., 1982 ; Ausubel F.M. et al. (eds), "Current 
Protocols in Molecular Biology", John Wiley & Sons, New York, 1 987]. 

25 Les enzymes de restriction ont ete fournies par New England Biolabs 

(Biolabs), Bethesda Research Laboratories (BRL) ou Amersham et sont utilis6es 
selon les recommandations des foumisseurs. 

Les plasmides de type pBR322, pUC et les phages de la s6rie M13 sont 
d'origine commerciale (Bethesda Research Laboratories). 

30 Pour les ligatures, les fragments d'ADN sont s6par6s selon leur taille par 

eiectrophor&se en gels d'agarose ou d'acrylamide, extraits au phenol ou par un 
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mfelange ph6nol/chloroforme, pr6cipit6s k l*6thanol puis incub6s en pr6sence de I'ADN 
ligase du phage T4 (Biolabs) selon les recommandations du fournisseur. 

Le remplissage des extremes 5' pro6minentes est effectu6 par le fragment de 
Klenow de I'ADN PoIymSrase I d 'E. coli (Biolabs) selon les sp6cifications du 
5 fournisseur. La destruction des extr6mit6s 3' pro6minentes est effectu6e en pr6sence 
de I'ADN Polymerase du phage T4 (Biolabs) utilis6e selon les recommandations du 
fabricant. La destruction des extr6mit6s 5' pro6minentes est effectufie par un 
traitement m6nag§ par la nuclfiase S1 . 

La mutag6n6se dirig6e in vitro par oligod6oxynuc!6otides synthfitiques est 
io effectuSe selon la m£thode d6velopp6e par Taylor et al. [Nucleic Acids Res. 13 
(1985) 8749-8764] en utilisant le kit distribuS par Amersham. Cette technique est 
notamment mise en oeuvre, dans le cadre de la pr6sente invention, pour cr6er des 
sites de restriction uniques en vu d'une insertion subs§quente. 

^amplification enzymatique de fragments d'ADN par la technique dite de PCR 
15 [Eolym6rase-catalyzed Chain Reaction, Saiki R.K. et al., Science 22Q (1985) 1350- 
1354; Mullis K.B. et Faloona F.A., Meth. Enzym. 155 (1987) 335-350] est effectu6e en 
utilisant un "DNA thermal cycler" (Perkin Elmer Cetus) selon les specifications du 
fabricant. 

La verification des sequences nucieotidiques est effective par la mfithode 
20 d6veiopp§e par Sanger et al. [Proc. Natl. Acad. Sci. USA, J£ (1 977) 5463-5467] en 
utilisant le kit distribufi par Amersham. 

Les transformations de K- |actis avec I'ADN des plasmides d'expression des 
prolines de la pr6sente invention sont effectuSes par toute technique connue de 
I'homme de Tart, et dont un exemple est donn6 dans le texte. 
25 Sauf indication contraire, les souches bact&iennes utilis6es sont E. coli 

MC1060 (laclPOZYA, X74, flalU. flalK. sttA r ), ou E. coli TG1 (lac, biqA.B, sucE, ltd, 

hsdD5 / F'iiaD36, xhqa + b + , iaci q . lacz, mi 5). 

Les souches de levures utilis6es appartiennent aux levures bourgeonnantes et 
plus particulferement aux levures du genre Kluyveromyces. Les souche K. lactis 
30 MW98-8C (a, liraA. atg. K + , pKD1°) et K. lactis CBS 293.91 ont 6t6 
particulifcrement utilis6es; un Schantillon de la souche MW98-8C a 6t6 d6pos§ le 1 6 
Septembre 1988 au Centraalbureau voor Schimmelkulturen (CBS) k Baarn (Pays 
Bas) ou il a 6t6 enregistr6 sous le num6ro CBS 579.88. 
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Une souche bact6rienne (E. coin transform§e avec le plasmide pET-8c52K a 
6t§ d6pos6e le 17 Avril 1990 auprds da r American Type Culture Collection sous le 

num6ro ATCC 68306. 

Les souches de levures transform6es par les plasmides d'expression codant 
5 pour les prolines de la pr6sente invention sont cultiv6es en erlenmeyers ou en 
fermenteurs pilotes de 21 (SETRIC, France) k 28°C en milieu riche (YPD: 1 % yeast 
extract, 2% Bactopeptone, 2% glucose; ou YPL: 1 % yeast extract, 2% Bactopeptone, 
2% lactose) sous agitation constante. 

10 

EXEMPLE 1: 

PROTOCOLE D'INSERTION STRICTE DE PEPTIDE EN UT1L1SANT UN SITE DE 
RESTRICTION UNIQUE, PRESENT DANS LE GENE DE LA SAH. 

15 De par son unicit§ dans le g&ne HSA et le vecteur associg, le site Exull 

localise naturellement dans la sequence codante, est particulterement utile comme 
site de clonage d'un peptide biologiquement actif que Ton ddsire insurer en phase 
traductionnelle dans la SAH au niveau du 58 i6me r6sidu. Dans un mode de 
realisation particulier, il est utile d'employer des peptides de p r6sidus dont la 

20 sequence codante est [3xN]p. Dans ce cas, les oligonucleotides synth§tis£s sont du 
type 5'-NN [3xN]pN-3' et son brin complSmentaire. La ligature de ce fragment avec le 
fragment de restriction Hindlll-Hindlll A Pvull, correspondent 4 la totality du g£ne 

codant pour la SAH, g6n£re un fragment de restriction Hindlll-Jtiindlll comportant un 
g£ne hybride codant pour une proline chimSre du type SAH insertion stricte. Dans 
25 un autre mode de realisation, le peptide peut dtre r6p6t6 plusieurs fois dans la 
chimdre. 

■ 

EXEMPLE 2: 

30 PROTOCOLE D'INSERTION TOTALE OU PARTI ELLE DE PEPTIDE EN UTILISANT 
DEUX SITES DE RESTRICTION UNIQUES ET NATURELS. 

Dans un mode de realisation particulier, ('utilisation de deux sites de restriction 
uniques permet de construire des genes codant pour des chimeres ayant le peptide 
actif en insertion stricte ou en substitution ou en insertion partielle. 
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Par exemple, I'existence des sites uniques tiiocll et AYlll dans la s6quence 
codante de SAH et dans le vecteur permet de g6n6rer un fragment Mndlll-MndHI A 
Hinc ll-Avrll. L'6limination du fragment nucteotidique £liQ£lI-Ay[H correspond k la 
dtl6tion du fragment peptidique T(420) - N(429). ^utilisation d'un oIigonuc!6otide 
5 compl6mentaire appropri6 permet de doner un peptide en phase codante dans le 
gSne de la SAH. Ce fragment de restriction peut §tre exclusivement la s6quence 
codante compl6mentaire du peptide ou peut correspondre k un panachage de la 
s6quence codante du peptide actif et de la sequence codante du fragment T(420) - 
N(429) de la SAH. Le peptide actif peut etre present plusieurs fois dans la chimSre. 

10 

EXEMPLE 3 

PLASMIDES D'EXPRESSION 

Les prot§ines chim6res des exemples pr§c§dents peuvent fitre exprimfies 
dans les levures k partir de promoteurs fonctionnels, r6gulables ou constitutifs, tels 

15 que, par exemple, ceux pr6sents dans les plasmides pYQ105 (promoteur LAC4 de 
Kluweromvces lactis) . pYG106 (promoteur PGK de Saccharomvces cereYlSiae), 
pYG536 (promoteur £tlQ5 de S.cerevisiae) . ou des promoteur hybrides tels que ceux 
dScrits dans la demande de brevet EP 361 991. Les plasmides pYG105 et pYG106 
sont ici particulterement utiles car ils permettent I'expression des g&nes cod§s par les 

20 fragments de restriction Hind lll tel que dficrits dans les exemples pr6c6dents et clon6s 
dans le site Hind lll et dans I'orientation productive (dSfinie comme I'orientation qui 
place la r6gion "pr6pro" de I'albumine de fagon proximate par rapport au promoteur de 
transcription), k partir de promoteurs fonctionnels chez KJaclia, r6gulables (pYG105) 
ou constitutifs (pYG106). Le plasmide pYG105 correspond au plasmide pKan707 

25 d6crit dans la demande de brevet EP 361 991 dans lequel le site de restriction tiindlll 
unique et localise dans le g&ne de resistance k la g6n6ticine (G41 8) a 6t6 dStruit par 
mutag6n6se dirig6e tout en conservant une proline inchangSe (oligodeoxynucleotide 
5 , -GAAATGCATAAGCTCTTGCCATTCTCACCG-3 , (SEQ ID N°3). Le fragment Sail- 
Sac l codant pour le g6ne URA 3 du plasmide mut6 a 6t6 ensuite remplacS par un 

30 fragment de restriction Salt- Sac l comportant une cassette d'expression constitute du 
promoteur LAC4 de K. lactis (sous la forme d'un fragment Sall-idifldlll) et du 
terminateur du g6ne PGK de S. cerevisiae (sous la forme d'un fragment hindM-SacD- 
Le plasmide pYG105 est mitotiquement trfes stable chez les levures Kluyveromvces. II 
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est represents en figure 3. Les plasmides pYG105 et pYG106 ne different entre eux 
que par la nature du promoteur de transcription encodd par le fragment Sai l- Hind i 
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EXEMPLE 4 : 

TRANSFORMATION DES LEVURES 

La transformation des levures appartenant au genre Kluyveromvces. et en 
particulier les souches MW98-8C et CBS 293.91 de KJactis. s'effectue par exemple 

5 par la technique de traitement des cellules entidres par de Tac6tate de lithium [Ito H. 
et al. t J. Bacteriol. 153 (1983) 163-168], adaptee comme suit. La croissance des 
cellules se fait k 28°C dans 50 ml de milieu YPD, avec agitation et jusqu'4 une densitS 
optique k 600 nm (DOqoo) comprise entre 0,6 et 0,8; les cellules sont r6colt6es par 
centrifugation k faible vitesse, Iav6es dans une solution sterile de TE (10 mM Tris HCI 

10 pH 7,4; 1 mM EDTA), resuspendues dans 3-4 ml d'acetate lithium (0,1 M dans du TP 
pour obtenir une densite cellulaire d'environ 2 x 10 8 cellules/ml, puis incubfees k 30°C 
pendant 1 heure sous agitation mod6r6e. Des aliquotes de 0,1 ml de la suspension 
r6sultante de cellules comp6tentes sont incubfes k 30°C pendant 1 heure en presence 
d'ADN et k une concentration finale de 35% de polyethylene glycol (PEG4000. 

15 Sigma). Aprfcs un choc thermique de 5 minutes k 42°C, les cellules sont Iav6es 2 fois, 
resuspendues dans 0,2 ml d'eau sterile et incub6es 1 6 heures k 28°C dans 2 ml de 
milieu YPD pour permettre Expression ph6notypique du gfene de resistance au G418 
exprim6 sous contraie du promoteur (cf. EP 361 991); 200 pi de la suspension 
cellulaire sont ensuite 6tal6s sur boites YPD s6Iectives (G418, 200 pg/ml). Les boites 

20 sont mises k incuber k 28°C et les transformants apparaissent aprds 2k3 jours de 
croissance cellulaire. 

EXEMPLE 5; 

25 SECRETION DES CHIMERES 

Aprfcs selection sur milieu riche suppl6ment6 en G41 8 les clones recombinants 
sont testes pour leur capacite k s6cr6ter la forme mature des proteines chim6res. 
Quelques clones correspondant k la souche CBS 293.91 ou MW98-8C transforms 
par les plasmides ^expression des chimeres entre la SAH et la partie biologiquement 
30 active sont mis k incuber en milieu YPD ou YPL k 28°C. Les surnageants cellulaires 
sont r6cup6r6s par centrifugation quand les cellules atteignent la phase stationnaire 
de croissance, 6ventuellement concentres 10 fois par precipitation pendant 30 
minutes k -20°C dans une concentration finale de 60% d'ethanol, puis testes aprds 
6Iectrophor6se en gel SDS-PAGE 0. soit directement par coloration du gel par du 
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bleu de coomassie, soit aprds immunobtot en utilisant des anticorps prlmaires dirig6s 
contre la partie biologiquement active ou un s6rum polyclonal de lapin dirig6 contre la 
SAH. Lors des experiences de detection immunologique, le filtre de nitrocellulose est 
d'abord incubd en presence des anticorps primaires sp6cifiques, Iav6 plusieurs fois, 

5 incube en presence d'anticorps de chfcvre dirig6s contre les anticorps primaires, puis 
incube en presence d f un complexe avidine-p6roxydase en utilisant le "kit ABC" 
distribu6 par Vectastain (Biosys S.A., Comptegne, France). La reaction 
immunologique est ensuite r6v6I6e par addition de diamino-3,3' benzidine 
tetrachlorydrate (Prolabo) en presence d'eau oxyg6n6e, selon les recommandations 

io du fabricant. 




15 



PURIFICATION DES CHIMERES 



Les chimdres pr6sentes dans les surnageants de culture correspondant & la 
souche CBS 293.91 transform^, sont caracteris6es dans un premier temps & I'aide 

* 

d'anticorps sp6cifiques de la partie SAH. II peut etre souhaitable de purifier certaines 
de ces chim&res. La culture est alors centrifug§e OOOOOg, 30 min), le surnageant est 

20 pass6 & travers un filtre de 0,22 mm (Millipore), puis concentre par ultrafiltration 
(Amicon) en utilisant une membrane dont le seuil de discrimination se situe & 30 kDa. 
Le concentrat obtenu est alors dialysd contre une solution de Tris HCI (50 mM pH 8) 
puis purifie sur colonne. Par exemple, le concentrat correspondant au surnageant de 
culture de la souche CBS 293.91 transform^ est purifie par chromatographic 

25 d'affinite sur Bleu-Trisacryl (IBF) et les 6chantillons sont ensuite dialysis contre de 
I'eau. Une purification par tamis moleculaire peut Stre ensuite r6alisee. Dans ce cas, 
une colonne Superose 1 2 (Pharmacia) est pr6alablement equilibree dans du tampon 
20mM NaH2P04 100mM NaCI pH 7.0 et les 6chantillons issus de la chromatographie 
d'affinite sont charges sur la colonne, r6colt6s et caracteris6s. Une purification par 

30 chromatographie d'6change d'ions peut 6galement etre utilis6e. Dans certains cas, le 
concentrat obtenu apr&s ultrafiltration est diaiyse contre une solution de Tris HCI (50 
mM pH 8), puis d6pose par fractions de 20 ml sur une colonne (5 ml) 6changeuse de 
cations (S Fast Flow, Pharmacia) equilibree dans le mSme tampon. La colonne est 
alors iav6e plusieurs fois par la solution de Tris HCI (50 mM pH 8) et la proteine 
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chimfcre est alors 6lu6e de la colonne par un gradient (0 4 1 M) de NaCI. Les 
fractions contenant la prot6ine chim6re sont alors r6unies et dialys6es contre une 
solution de Tris HCI 50 mM (pH 8) puis red6pos§es sur colonne S Fast Flow. Aprfcs 
6lution de la colonne, les fractions contenant la proline sont r6unies, dialys6es 
5 contre de I'eau et lyophilis§es avaht caract6risation. 



INSERTION DU PEPTIDE 1 1 DANS LA SEQUENCE DE L'ALBUMINE 

10 

Le peptide 11 a 6t§ d§crit en tant qu'6pitote de la tryptophane synthase 
(Larvor et al. Mol. Immunol. (1991) 28, 523-531). 

1. Insertion stricte dans une region ayant naturellement des sites de restriction 
15 uniques 

Le peptide 1 1 a la s6quence peptidique suivante dans le sens N a C terminal: 
HGRVGIYFGMK (SEQ ID N°20). La strat6gie tfinsertion dans la zone Miflfill-Mtl a 
consists & faire une ligature entre un fragment nucteotidique synth6tis6 codant pour le 
20 peptide 1 1 et tel que le cadre de lecture pour la s6quence codante de la SAH soit 
respect6 et que la nature de la sequence codante de la SAH soit celle d'origine. Dans 
un premier temps nous avons done synth6tis6 les deux oligonucl6otides suivants : 5'- 
CCATGGTAGAGTAGGTATCTATTTCGGTATGAAAACTCCAACTCTTGTAGAGGTCT 

CGAGAAAT-3' (SEQ ID N°4) et 5'- 

25 CTAGATTTCTCGAGACCTCTACMGATGTGGAGTTTTCATACCGAAATAGATACCT 

ACTCTACCATGG-3 , (SEQ ID N°5). Ces deux oligonuclfiotides ont 6t6 hybrid6s puis 
ligaturfis au fragment Hind lll- Hind lll A Hincll-A^ll. g6n6rant ainsi la totalit6 du g&ne de 
la SAH comportant I'insertion totale du peptide 11 entre S(419) et T(420), imm§- 
diatement pr6c6d6e de la r6gion d'exportation'prSpro" de la SAH. Ce fragment est 
30 clon§ dans I'orientation productive et dans le site HindHI du plasmide pYG1 05. 
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2. Insertion par substitution et addition dans une region ayant naturellement des 
sites de restriction uniques 

Dans un autre mode de realisation, les oligonucleotides synth6tis6s ont 6t6 les 
5 suivants : 5 -CCATGGTAGAGTAGGTATCTATTTCGGTATGAAA-3 , 

(SEQ ID N°6) et 5 , -CTAGTTTCATACCGAAATAGATACCTACTTCTTACCATGG-3 , 
(SEQ ID N°7). Ces deux oligonucleotides sont hybrid6s puis ligatures au fragment 
Hind lll- Hind lll A Hinc ll-Avrll. g6nerant ainsi le gdne d'une chim&re ampute des r6sidus 

T(420) & N(429) et substitu6s par la sequence du peptide 11. Ce fragment est clone 
10 dans I'orientation productive et dans le site idindW 3u piasmide pYG105. 

* 

3. Insertion stricte dans une region ne possedant pas de sites de restriction 
uniques 

15 Dans un autre mode de realisation, deux sites de restriction uniques, Sst I et 

Xho 1, ont 6t6 cr6es par mutag6r.ese dirig6e. Le mSme type de clonage directionnel 
a 6t6 r6alis6. Dans le cas de I'inserticn totals du peptide 1 1 entre les r6sidus A(191) 
et S (192), les deux oligonucleotides suivants ont 6t6 synth6tis6s :5'- 
ACGGGATGAAGGGAAGGCCCATGGTAGAGTAGGTATCTATTTCGGTATGAAA-3' 

20 (SEQ ID N°8) et 5'- 

TCGATTTCATTACCGAAATAGATACCTACTCTACCATGGGCCTTCCCTTCATCCCG 
TAGCT-3' (SEQ ID N°9). On precede ensuite, selon le protocole d6j& d6crit aux 
points 1 ou 2 precedents, pour la realisation du piasmide d'expression correspondant. 

25 4, Insertion par substitution dans une region ne possedant pas de sites de 

restriction uniques 

+ 

(.'insertion necessite au pr6alable la creation des deux sites de restriction Sst I 
et Xho 1 . Pour 1'insertion partielle en soi, les deux oligonucleotides suivants ont 6t6 
30 synth6tis6s : ff-ACATQGTAQ AGTAGGTATCTATTTCGGTATGAAA-ff (SEQ ID N°1 0) 
et ff-TCGATTTCATACCGAAATAGATACCTACTCTACCATGTAGCT-S' (SEQ ID 
N°11). Dans ce dernier cas, les r6sidus R(186) & A(191) sont deletes et substitu6s 
par le peptide 11. Les deux plasmides d'expression de ces chimeres sont 
respectivement 1671 et 1667. 
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Chacun des plasmides, obtenus aux points 1, 2, 3 et 4, est utilisfe pour 
transformer une souche de levure selon le protocole d6crit en exemple 4. les 
prot§ines correspondantes sont secr6t6s et purifi6es seon les exemples 5 et 6. 

EXEMPLE 8 

INSERTION DE LA SEQUENCE PEPTIDIQUE IEGR, SUBSTRAT DU FACTEUR Xa 

On proc&de k Insertion du peptide contenant la s6quence IEGR (SEQ ID 
N°21), cible du facteur prot6asique Xa, qui coupe la prothrombine en thrombine dans 
la cascade de factions intervenant dans la coagulation sanguine. 

1. Insertion stride dans la region 5 ayant un site Pvull unique 

Pour effectuer cette insertion au site Pvu II du g&ne de I'albumine, il est dans 
un premier temps cr6er un vecteur r6plicatif d6pourvu du site Pvu II, dans lequel est 
ensuite ins6r§ le gdne codant pour la pr6pro albumine. Les deux oligonuc!6otides 
compl6mentaires suivants, codant pour la s&quence IEGR, ont 6galement 6t6 
synth§tis6s: 5'-GATCCATAGAAGGTCGACTAG-3'(SEQ ID N°12) et 3'- 
CTAGGTATCTTCCAGCTGATC-5'(SEQ ID N°13). Ces deux oligonucteotides sont 
ensuite hybrid6s puis ins6r§s au niveau du site Pvu II du g6ne de falbumine. Les 
modifications que leur insertion fait apparaitre sur les s6quences nucteotidiques et 
peptidiques de I'albumine sont report6es sur la figure 4. La strat6gie de clonage est 

sch6matis6e en figure 5. 

Cette construction illustre le cas oO des sequences de jonction sont introduces 

de part et d'autre du peptide d'int6r6t. 
2.lnsertion dans la r&gion 13. 

La figure 6 dScrit une strat6gie de clonage d'un peptide dans la r6gion 13. 

Dans ce qui suit, les sites de restriction Sst I et Xho 1 ont 6te cr66s par 
mutag6n6se dirig6e. Dans le cas de la r6gion 1 3 on a proc6d6 respectivement & une 
substitution totale et une substitution addition de la s6quence IEGR simple ou 
encadr6e de sequences de jonction. 
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En ce qui concerns la substitution totale, les oligonucleotides utilises sont les 
suivants: 

5*- CAGAATCGAAGGTAGAGCC-3 1 (SEQ ID N°14) et 
5'- TCGAGGCTCTACCTTCGATCGAGGGTAGCT- 3 l (SEQ ID N°1 5) 
5 Au niveau protSique, la sequence (187)DEGK (SEQ ID N°22) est substitute 

par IEGR. 

Dans le second cas, les oligonucleotides utilises sont les suivants: 
5'- ACCCTCGATCGAAGGTAGATCTCCA- 3'(SEQ ID N°1 6) 
10 5'- TCGATGGAGATCTACCTTCGATCGAGGGTAGCT-3 , (SEQ ID N°1 7) 

Au niveau prot6ique, I'albumine est amputee du r6sidu R(186) au r6sidu 
A(191) et remp!ac6e par la sequence PSIEGRSP (SEQ ID N°23) conduisant done a 

une addition de deux rfisidus. 

Les prolines correspondantes sont secr6t6es et purifi6es suivant les 
15 protocoles d§crits dans les exemples precedents, le tableau I ci-aprds rend compte 
des caract6ristiques de ces chimeres. 



Albumine 
incorporant: 


Taux d'expression 
ua/l 


Rendement 
purification 


Purete finale 


IEGR 


150 


15 


98 


PSIEGRPS 


110 


36 


96 



TABLEAU I 



20 

3 Activite biolooioue de la chimera obtenue selon t'exemple 8.1 

* 

La chimdre est incub6e en presence de facteur Xa bovin dans un rapport 
25 enzyme/substrat de 1/10 et dans un tampon Tris 50mM, NaCI 100mM t CaCl2 1mM, 
pH 8.0 pendant 3 heures & 37°C. A Tissue de ce traitement, on realise une analyse 
SDS PAGE qui met en evidence le clivage de la chimdre caract6ris6 par la generation 
d'un fragment cTalbumine de masse de Tordre de 60kD. 



30 
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LISTE DE SEQUENCES 



(1) INFORMATION GENERALE: 

( i ) DEPOSANT : 
5 (A) NOM: RHONE- POUL EN C RORER S.A. 

(B) RUE: 20, avenue Raymond ARON 

(C) VTLLE: ANTONY 

(E) PAYS: FRANCE 

(F) CODE POSTAL: 92165 

10 (ii) TITRE DE L 1 INVENTION :Nouveaux polypeptides biologiquement actifs, 

leur preparation et composition pharmaceutique les contenant. 

(iii) NOMBRE DE SEQUENCES: 24 

(iv) FORME LISIBLE PAR ORDINATEUR: 

(A) TYPE DE SUPPORT: Tape 

15 (B) ORDINATEUR: IBM PC compatible 

(C) SYSTEME D" EXPLOITATION: PC-DOS/MS-DOS 

(D) LOGICIEL: Patentln Release #1.0, Version #1.25 (OEB) 

(2) INFORMATION POUR LA SEQ ID NO: 1: 

(i) CARACTERI STIQUES DE LA SEQUENCE: 
20 (A) LONGUEUR: 13 paires de base 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS : simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

25 (iii) HYPOTHETIQUE: NON 

(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 1: 



GGCCNNNNNG GCC 



13 



30 



(3) INFORMATION POUR LA SEQ ID NO: 2: 

(i) CARACTERI STIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 8 paires de base 

(B) TYPE: acide nucleique 

35 (C ) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 
(iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS: NON 

40 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 2: 



GCGGCCGC 



6 
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(4} INFORMATION POUR LA SEQ ID NO: 3 

(i) CARACTERISTIQUES DE LA SEQUENCE: 
(A) LONGUEUR: 30 paires de bases 
c (B) TYPE: acide nucleique 

(C) N OMB RE DE BRINS; simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(iii) HYPOTHETIQUE: NON 

10 SfSS SflA SEQUENCE: SEQ ID NO: 3: 

GAAATGCATA AGCTCTTGCC ATTCTCACCG 



15 (5) INFORMATION POUR LA SEQ ID NO: 



4: 



20 



(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 64 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS : simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(iii) HYPOTHETIQUE: NON 
Mill ANTI-SENS: NON 

xt) DESCRIPTION DE LA SEQUENCE: SEQ ID 



25 



GAAAACTCCA ACTCTTGTAG AGGTCTCGAG 60 

64 



AAAT 



in ffi\ INFORMATION POUR LA SEQ ID NO: 5 

30 (6) I ^OR^ cTERISTIQUES D£ ^ SEQUENCE: 

(A) LONGUEUR: 68 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 
35 (D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: ADNc 
(iii) HYPOTHETIQUE: NON 

Mill ANTI-SENS: NON 0 
\A) dScRIPTION DE LA SEQUENCE: SEQ ID N 5 

40 CTAGATTTCT CGAGACCTCT ACAAGATGTG GAGTTTTCAT ACCGAAATAG 
TACCATGG 



68 



45 
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(7) INFORMATION POUR LA SEQ ID NO: 6: 

(i) CARACTERI STIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 34 paires de bases 

(B) TYPE: acide nucleique 

5 (C) NOMBRE DE BRINS: simple 

<D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 
(iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS: NON 
10 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N°6 



CCATGGTAGA GTAGGTATCT ATTTCGGTAT GAAA 



15 (8) INFORMATION POUR LA SEQ ID NO: 7: 

(i) CARACTERI STIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 40 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 
20 (D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: ADNc 
(iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N°7 



25 



CTAGTTTCAT ACCGAAATAG ATACCTACTT CTTACCATGG 



(9) INFORMATION POUR LA SEQ ID NO: 8: 
30 (i) CARACTERI STIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 52 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
35 (ii) TYPE DE MOLECULE: ADNc 

(iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N°8 



40 ACGGGATGAA GGGAAGGCCC ATGGTAGAGT AGGTATCTAT TTCGGTATGA AA 



(10) INFORMATION POUR LA SEQ ID NO: 9: 

(i) CARACTERI STIQUES DE LA SEQUENCE: 
45 (A) LONGUEUR: 61 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

50 (iii) HYPOTHETIQUE: NON 

(iii) ANTI-SENS: NON 
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30 



45 



(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N°9 



TCGATTTCAT TACCGAAATA GATACCTACT CTACCATGGG CCTTCCCTTC ATCCCGTAGC 

61 



(11) INFORMATION POUR LA SEQ ID NO: 10: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 
(A} LONGUEUR: 34 paires de bases 
10 (B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS : simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(iii) HYPOTHETIQUE: NON 
15 (iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N 10 



ACATGGTAGA GTAGGTATCT ATTTCGGTAT GAAA 34 

20 (12) INFORMATION POUR LA SEQ ID NO: 11: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 42 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 
25 (D) CONFIGURATION : lineaire 

(ii) TYPE DE MOLECULE: ADNc 
' (iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N°ll 



TCGATTTCAT ACCGAAATAG ATACCTACTC TACCATGTAG CT 42 



(13) INFORMATION POUR LA SEQ ID NO: 12: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
35 (A) LONGUEUR: 21 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

40 (iii) HYPOTHETIQUE: NON 

(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 12 



GATCCATAGA AGGTCGACTA G 

(14) INFORMATION POUR LA SEQ ID NO: 13: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 21 paires de bases 

(B) TYPE: acide nucleique 
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(C) N OMB RE DE BRINS: simple 

(D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: ADNc 

(iii) HYPOTHETIQUE: NON 

5 (iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N 13 

21 

CTAGGTATCT TCCAGCTGAT C 

10 (15) INFORMATION POUR LA SEQ ID NO: 14: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 19 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 
15 (D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: ADNc 
(iii) HYPOTHETIQUE: NON 

(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N 14 



20 

CAGAATCGAA GGTAGAGCC 

(16) INFORMATION POUR LA SEQ ID NO: 15: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 
25 (A) LONGUEUR: 30 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

30 (iii) HYPOTHETIQUE: NON 

(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N 15 



19 



TCGAGGCTCT ACCTTCGATC GAGGGTAGCT 30 



35 



(17) INFORMATION POUR LA SEQ ID NO: 16: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 24 paires de bases 

(B) TYPE: acide nucleique 
40 (C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 
(iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS : NON 
45 (xi ) DESCRIPTION DE LA SEQUENCE: SEQ ID N 16 



ACCCTCGATC GAAGGTAGAT CTCCA 24 

(18) INFORMATION POUR LA SEQ ID NO: 17: 
50 (i) CARACTERISTIQUES DE LA SEQUENCE: 
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(A) LONGUEUR: 33 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS: NON 

(XX) DESCRIPTION DE LA SEQUENCE: SEQ ID N 17 



10 TCGATGGAGA TCTACCTTCG ATCGAGGGTA GCT 33 

(19) INFORMATION POUR LA SEQ ID NO: 16: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 
(A) LONGUEUR: 31 paires de bases 
15 (B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(iii) HYPOTHETIQUE: NON 
20 (iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N 18 



GTCCCGGATG GAGCGCGTAC TTAGAGAGAA T 31 

25 (20) INFORMATION POUR LA SEQ ID NO: 19: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 39 paires de bases 

(B) TYPE: acide nucleique 

(C) NOMBRE DE BRINS: simple 
30 (D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: ADNc 
(iii) HYPOTHETIQUE: NON 
(iii) ANTI-SENS: NON 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID N 19 



35 



TCGACAGGGC CTACCTCGCG CATGAATCTC TCTTAAGCT 39 



(21) INFORMATION POUR LA SEQ ID NO: 20: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 
40 (A) LONGUEUR: 11 acides amines 

(B) TYPE: acide amine 
(D) CONFIGURATION: lineaire 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 20 



45 His Gly Arg Val Gly He Tyr Phe Gly Met Lys 

5 10 
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(22) INFORMATION POUR LA SFQ ID NO: 21: 

{i) CARACTERI STIQUES DS LA SEQUENCE: 

(A) LONGUEUR: 4 acides amines 

(B) TYPE: acide amine 

5 (D) CONFIGURATION: lineaire 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 21 



lie Glu Gly Arg 

10 (23) INFORMATION POUR LA SEQ ID NO: 22: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 4 acides amines 

(B) TYPE: acide aioi ne 

(D) CONFIGURATION: lineaire 

15 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 22 



Asp Glu Gly Lys 

(24) INFORMATION POUR LA SEQ ID NO: 23: 
20 (i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 8 acides amines 

(B) TYPE: acide amine 

(D) CONFIGURATION: lineaire 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 23 

25 

Pro Ser He Glu Gly Arg Ser Pro 

5 

30 (25) INFORMATION POUR LA SEQ ID NO: 24: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 9 acides amines 

(B) TYPE: acide amine 

(D) CONFIGURATION: lineaire 

35 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 24 

Arg Met Glu Arg Val Leu Arg Glu Asn 

5 
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REVENDICATIONS 

1 . Polypeptide recombinant comportant au moins une partie active derivee d'un 
polypeptide, naturel ou synthetique, biologiquement actif t genetiquement inseree 
dans une albumine ou un de ses variants ou derives. 

5 2. Polypeptide selon la revindication 1 caracterise en ce que le polypeptide 
biologiquement actif possede une activite therapeutique et est d'origine humaine. 

3. Polypeptide selon la revindication 2 caracterise en ce que le polypeptide 
ayant une activite therapeutique est choisi parmi tout ou partie des enzymes, des 
inhibiteurs d'enzymes, des antigenes, des anticorps, des hormones, des recepteurs, 

10 des facteurs de la coagulation, des interferons, des cytokines, des facteurs de 
croissance et/ou de differentiation, des facteurs impliques dans la genese/resorption 
des tissus osseux, des facteurs chimiotactiques, des facteurs de motilite ou de 
migration cellulaire, des facteurs cytostatiques, des facteurs bactericides ou 
antifongiques, ou des molecules adhesives plasmatiques, interstitielles ou des 

15 matrices extracellulaires. 

4. Polypeptide selon Tune dss revendications 1 a 3 caracterise en ce que le 
polypeptide ayant une activite therapeutique est choisi parmi toute sequence 
peptidique antagoniste ou agoniste d'interactions moleculaires et/ou cellulaires 
impliquees dans les pathologies des compartiments circulatoires et interstitiels. 

20 5. Polypeptide selon Tune des revendications 1 a 4 caracterise en ce que la 
partie active presente une structure choisie parmi : 

(a) la structure peptidique entiere ou, 

(b) un fragment de (a) ou une structure derivee de (a) par modification 
structural (mutation, substitution, addition et/ou deletion d'un ou plusieurs residus) et 

25 conservant une activite therapeutique. 



6. Polypeptide selon Tune des revendications 1 a 5 caracterise en ce que la 
partie active est inseree strictement a Tinterieur de I'aibumine ou entouree de 
sequence de jonctions. 



WO 95/30759 



PCI7FR95/00520 



32 



7. Polypeptide selon Tune des revendications 1 a 6 caracterise en ce que la 
partie active est inseree de preference au niveau des regions de I'albumine 
presumees former des regions exposees a la surface de la molecule. 

8. Polypeptide selon Tune des revendications 1 a 7 caracterise en ce que la 
5 partie active est inseree au niveau de la region 5 s'etendant du residu 57 a 62 de 

I'albumine. 

* 

9. Polypeptide selon Tune des revendications 13 7 caracterise en ce que la 
partie active est inseree au niveau de la region 8 s'etendant du residu 103 a 120 de 
I'albumine. 

10 10. Polypeptide selon Tune des revendications 1 a 7 caracterise en ce que la 
partie active est inseree au niveau de la region 13 s'etendant du residu 178 a 200 de 
I'albumine. 

11. Polypeptide selon Tune des revendications 1 a 7 caracterise en ce que la 
partie active est inseree au niveau de la region du residu 415 au residu 425 delimitee 

15 par les helices h2 et h3 du domaine III dans I'albumine. 

12. Polypeptide selon Tune des revendications 1 a 12 caracterise en ce que la 
partie active y est inseree sous forme unique ou multiple. 

13. Polypeptide selon la revendication 12 caracterise en ce que la partie active 
est repeter plusieurs fois au meme endroit et/ou dans des regions differentes de 

20 I'albumine. 

14. Polypeptide selon Tune des revendications 1 a 13 caracterise en ce que les 
parties actives inserees sont de nature differente. 

15. Variant d'une sequence nucleotidique codant pour I'albumine ou un de ses 
variants ou derives integrant au moins un site de restriction unique non naturel. 

25 16. Sequence nucleotidique codant pour un polypeptide selon Tune quelconque 
des revendications 1 a 14. 



17. Sequence nucleotidique selon la revendication 16 caracterisee en ce qu'elle 
comprend une sequence "leader" permettant la secretion du polypeptide exprime. 
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18. Cassette d'expression comprenant une sequence nucleotidique seion I'uiie 
des revendications 16 ou 17 sous le controle d'une region d'initiation de la 
transcription et eventuellement d'une region de terminaison de la transcription. 

19. Plasmide autoreplicatif comportant une cassette d'expression selon la 
5 revendication 18. 

20. Cellule recombinante eucaryote ou procaryote dans laquelle a ete insere une 
sequence nucleotidique selon Tune des revendications 16 ou 17 ou une cassette 
d'expression selon la revendication 18 ou un plasmide selon la revendication 19. 

21. Cellule recombinante selon la revendication 20 caracterisee en ce qu'il s'agit 
10 d'une levure, d'une cellule animale, d'un champignon ou d'une bacterie. 

22. Cellule recombinante selon la revendication 21 caracterisee en ce qu'il s'agit 
d'une levure. 

23. Cellule recombinante selon la revendication 22 caracterisee en ce qu'il s'agit 
d'une levure du genre Saccharomvces ou Kluweromvces . 

15 24. Procede de preparation d'un polypeptide tel que defini dans Tune des 
revendications 1 a 14 caracterise en ce que Ton cultive une cellule recombinante 
selon Tune des revendications 20 a 23 dans des conditions d'expression, et on 
re cup ere le polypeptide produit. 

25. Composition pharmaceutique comprenant un ou plusieurs polypeptides seion 
20 Tune quelconque des revendications 1 a 14. 

26. Composition pharmaceutique comprenant une sequence nucleotidique selon 
I'une quelconque des revendications 16 a 17 utilisable en therapie genique. 
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1. 

BIOLOGICALLY ACTIVE POLYPEPTIDES INSERTED INTO AN 

ALBUMIN 

The present invention relates to new biologically active polypeptides, their 
preparation and pharmaceutical compositions containing them. 

Although possessing one or several potential therapeutic activities, 
numerous polypeptides unfortunately cannot be exploited pharmaceutically. 
This may be for different reasons, in particular their weak stability in vivo, 
their complex or fragile structure, the difficulty of producing them on an 
acceptable industrial scale, etc. Likewise, certain polypeptides do not give 
the expected results in vivo, due to problems of administration, conditioning, 
pharmacokinetics, etc. 

The aim of the present invention is precisely to overcome these drawbacks. 

It is particularly aimed at the development of artificial, biologically active 
proteins permitting optimal exploitation of the biological properties of these 
polypeptides for therapeutic use. 

The applicant accordingly provides evidence that using genetic methods, it is 
possible to insert any active structure derived from a biologically active 
protein into another protein structure consisting of albumin, without altering 
the said biological properties. In an unexpected way, human serum albumin 
allows the active structure to be efficiently introduced at its sites of 
interaction, and increases the plasmatic stability of the recombinant 
polypeptide embodied in the invention. 

More precisely, the present invention relates to a recombinant polypeptide 
containing at least one active portion derived from a natural or artificial 
biologically active polypeptide inserted genetically into an albumin or one of 
its variants or derivatives. 

According to the present invention, a variant of albumin is understood to 
designate any protein with long plasma half-life obtained by modification of 
a gene coding for a given isomorph of the human serum albumin by genetic 
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engineering techniques. It is also understood to designate any 
macromolecule with long plasma half-life obtained by in vitro modification 
of the protein coded for by such genes. (Modification is understood to mean 
any mutation, substitution, deletion, addition or modification of a genetic 
and/or chemical nature.) Albumin being very polymorphic, numerous 
natural variants have been identified and listed [Weitkamp L.R. et al., Ann. 
Hum. Genet. 37(1973)219]. 

Derivatives of albumin most particularly refers to molecules consisting 
wholly or partly of albumin, linked with at least one polypeptide sequence 
resulting from a natural or artificial gene, whether or not it has biological 
activity itself. 

In the description that follows, the different types of albumins clarified 
above are commonly designated by the term "albumin." 

In the sense of the present invention, active portion is understood to mean a 
portion having an activity which can be either direct (treatment of diseases, 
diagnosis, biological research, markers) or indirect (for example, usable in 
preventing diseases, in the design of vaccines, in medical imaging 
techniques, etc). 

The active portions of biologically active polypeptides inserted according to 
the invention are preferably of therapeutic interest. 

Polypeptides having therapeutic activity may or may not be of human 
origin. 

As representative of polypeptides of non-human origin, one may cite 
peptides or their derivatives having properties potentially useful in the 
pathologies of blood or interstitial compartments, such as hirudine, 
trigramine, antistatine, the anticoagulant peptides of ticks (TAP), arietine, 
applagine, etc. 

According to a preferred embodiment of the invention, the polypeptide 
having a therapeutic activity is a polypeptide of human origin or a molecular 
variant. For example, it may consist, wholly or partly, of an enzyme, an 
enzyme inhibitor, an antigen, an antibody, a hormone, a receptor, a factor 
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involved in the control of coagulation, an interferon, a cytokine [the 
interleukins, but also the natural variant antagonists of their attachments to 
receptors, the SIS (small induced secreted)-type cytokines and, for example, 
the inflammatory proteins of macrophages (MIPs), etc], a growth and/or 
differentiation factor [and for example the transforming growth factors 
(TGFs), the differentiation factors of blood cells (erythropoietin, M-CSF, G- 
CSF, GM-CSF, etc.), insulin and the growth factors that resemble it (IGFs), 
or also the cellular permeability factors (VPF/VEGF), etc.], a factor 
implicated in the genesis or resorption of bone tissue (OIF and 
osteospontine, for example), a factor implicated in cellular motility or 
migration [for example, the autocrine mobility factor (AMF), the migration 
stimulating factor (MSF), or also the dispersion factor (scatter 
factor/hepatocyte growth factor)], a bactericidal or antifungal factor, a 
chemotactic factor [for example, the platelet factor 4 (PF4), or the monocyte 
chemoattractant peptides (MCP/MCAF) or neutrophils (NCAF), etc.], a 
cytostatic factor (for example, the proteins that attach to galactosides), an 
adhesive molecule, either plasmatic (for example, the von Willebrand factor, 
fibrinogen, etc.), or interstitial (laminine, tenascene, vitronectine, etc.), or 
extracellular matrices, or also any peptide sequence antagonistic or agonistic 
to molecular and/or intercellular interactions implicated in the pathologies of 
circulatory and interstitial compartments, for example the formation of 
arterial or venous thrombi, cancerous metastases, tumoral angiogenesis, 
inflammatory shock, autoimmune diseases, bone and osteo-articular 
pathologies, etc. 

Of course, the active portion of the polypeptides of the invention may 
consist of the entire biologically active peptide or a structure derived from it, 
or may also correspond to a non-natural peptide sequence, isolated for 
example from random peptide banks. (For purposes of simplification, these 
different possibilities will be covered in what follows by the collective 
designation "active portion of a biologically active peptide.") In the sense of 
the present invention, derived structure is understood to mean any 
polypeptide obtained by modification which retains its biological activity. 
Modification is understood to mean any mutation, substitution, deletion, 
addition or modification of genetic and/or chemical nature. Such derivatives 
may be generated for different purposes, especially 
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such as increasing the affinity of the molecule for its attachment sites, 
improving its level of production, increasing its resistance to proteases, 
enhancing its therapeutic efficacy or reducing its secondary effects, or giving 
it new biological properties. As an example, the chimeric polypeptides of the 
invention have pharmacokinetic properties and biological activity usable in 
the prevention or treatment of diseases. 

The polypeptides according to the invention that are particularly 
advantageous are those in which the active portion has: 

(a) the entire peptide structure, or 

(b) a fragment of (a) or a structure derived from (a) by structural 
modification (mutation, substitution, addition and/or deletion of 
one or several residues) and possessing therapeutic activity. 

Among the structures of type (b) may be cited in particular the molecules in 
which certain sites of N- or O-glycosylation have been modified or deleted, 
molecules in which one or several residues have been substituted, or 
molecules in which all the cysteine residues have been substituted. One may 
also cite molecules obtained from (a) by deleting the regions having little or 
no involvement in the interaction with the linkage sites under consideration, 
or expressing an undesirable activity, and molecules including 
supplementary residues with respect to (a), for example, a terminal 
methionine-N, and/or a secretion signal, and/or a junction peptide. 

The subject matter of the present invention is particularly advantageous in 
regard to active peptides that are too small to form a structural domain, 
and/or do not possess good stability in vivo and/or good bioavailability. The 
proposed method of insertion according to the invention allows them to be 
associated with one or more preexisting domains of albumin and thus to 
benefit from the bioavailability and stability in vivo of the latter. 

In general, the sizes of the active portions inserted into the albumin vary 
between three and twenty five amino acid residues. However, sequences of 1 
to 100 residues can also be used. 
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The insertion of an active portion of a peptide into the peptide sequence of 
albumin is accomplished according to the invention in a way satisfying the 
following two conditions: 

Sufficient accessibility must be preserved to keep intact the biological 
activity of the active portion inserted into the albumin. Moreover, the 
structure of the albumin of course cannot be subjected to a destabilization 
significant enough to be detrimental to the recombinant polypeptide, or 
chimera. 

The sites of insertion are preferably selected within the albumin while 
adhering to the above requirements. 

According to the crystalline structure published by He and Carter (Nature 
1992, 358 , 209-215), albumin is formed by the repetition of 3 domains, each 
comprising two subdomains, and it consists of over 67% alpha helices. Each 
domain is superimposable on the others and is composed of 10 helices 
designated hi to hlO. The subdomain A consists of helices hi to h6 and the 
subdomain B, helices hi to hlO. Each subdomain is formed in a common 
pattern: hi, h2, h3, h4 for domain A, and h7 5 h8, h9 and hlO for domain B. 
The small supplementary helices h5 and h6 are linked by a disulfide bond to 
subdomain A. Figure 1 schematizes the structure of human serum albumin. 

The sites of insertion are preferably located in the regions of the albumin 
presumably forming the exposed regions on the surface of the molecule; 
these regions preferentially being loops. 

As insertion sites that are particularly suitable for the invention, three 
regions of the first domain may be mentioned: 

- region 5, which extends from residue 57 to 62 and corresponds to a loop 
joining the helices h3 and h4; 

- region 8, comprising the residues 103 to 120 and corresponding to the 
inter- subdomain zone; 

- region 13, between residues 178 and 200, which corresponds to a helix. 

According to the invention, helices h2 and h3 of domain III delimit another 
conceivable insertion region, from residues 419 to 430. 



I 
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An active portion of the biologically active peptide may be inserted in the 
albumin peptide sequence according to three different embodiments of the 
invention: 
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- A strict insertion consisting of a simple addition of the peptide sequence 
of interest into the original sequence of the albumin, which is totally 
conserved. 

- The insertion may correspond to a substitution of a portion of the 
albumin peptide sequence by the peptide sequence corresponding to the 
active portion of the peptide of interest. 

- Finally, an insertion combining an addition of a part of the active peptide 
sequence and a substitution of a portion of the albumin peptide sequence 
by the rest of the active portion of the active peptide. 

Figure 2 schematically depicts these different modes of insertion. 

Of course, the active portion of a biologically active peptide can be repeated 
several times in the chimera at the same place and/or in different regions of 
the albumin. Likewise, it is equally possible as another embodiment of the 
invention to insert different active portions, whether stemming from the 
same peptide or from different peptides. 

Finally, in another embodiment of the invention, an active portion of a 
peptide may be inserted either within the albumin or by surrounding the 
sequences of the junction. 

In regard to the junction sequences, peptide sequences rich in glycine 
residues and/or serine residues and/or threonine residues, and/or any amino 
acid residue described as frequently encountered in the zones of flexibility in 
proteins, may be especially involved. 

The recombinant polypeptides according to the invention prove to be 
particularly advantageous. 

They permit a given biological activity to be maintained in an organism for 
an extended period. It thus proves to be possible to reduce the administered 
doses, and in certain cases to potentiate the therapeutic effect, for example 
by reducing the consecutive secondary effects of a larger dose. 
Advantageously, they allow the production and utilization of structures 
derived from biologically active polypeptides that are very small and 
therefore very specific to a desired effect. Moreover, these recombinant 
polypeptides are distributed in the organism especially advantageously, 



* 



WO 95/30759 



PCT/FR95/00520 



modifying their pharmacokinetic properties and favoring the development of 
their biological activity and their utilization. They also have the advantage of 
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being weakly or nonimmimogenic for the organism in which they are used. 
Finally, the polypeptides of the invention may be expressed (or preferably 
secreted) by recombinant organisms at levels permitting their industrial use. 

Another aim of the invention relates to a process for preparing the chimeric 
molecules described above. More precisely, this process consists of causing 
a nucleotide sequence coding for an active portion of the desired polypeptide 
to be expressed by a eukaryote or prokaryote cellular host, then harvesting 
the product polypeptide. 

Among the eukaryote hosts usable within the scope of the present invention 
one may cite animal cells, yeasts or fungi. In particular, for yeasts one may 
cite those of the type Saccharomyces, Kluweromvces, Pichia, 
Schwanniomyces or Hansenula . For animal cells, COS, CHO, CI 27, etc. 
may be cited. Among the fungi able to be used according to the present 
invention one may cite in particular Aspergillus ssp, or Trichoderma ssp. As 
prokaryote hosts, bacteria such as Escherichia coli or those belonging to the 
type Corvnebacterium , Bacillus or Streptomyces would be preferable. 

The nucleotide sequences usable within the scope of the present invention 
can be prepared in different ways. Generally they are obtained by 
assembling in the reading frame the sequences coding for each of the 
functional parts of the polypeptide. These can be isolated by techniques of a 
person skilled in the art, for example, directly from cellular ARN 
messengers (ARNm), or by recloning from a bank of complementary ADN 
(ADNc), or again, totally synthetic nucleotide sequences may be involved. 
Moreover, it is understood that nucleotide sequences may also be 
subsequently modified, for example, by genetic engineering techniques, to 
obtain derivatives or variants of the said sequences. 

The insertion of this nucleotide sequence coding for the active portion of the 
polypeptide may or may not be directly achieved, depending on the region 
chosen for the insertion site in the gene coding for albumin. 

The region selected may not actually include the restriction site adequate for 
the said insertion. According to this hypothesis, it may prove to be of interest 
to introduce one or several single restriction sites prior to the insertion. 
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The creation of restriction sites at the level of the region chosen for the 
insertion site is preferably done by directed mutagenesis according to 
traditional techniques. However, one can also insert the nucleotide sequence 
corresponding to "the active portion of the biologically active peptide" 
directly by directed mutagenesis so that the insertion does not cause 
particular restriction sites to appear. 

In the particular case of region 5 in the gene coding for albumin, the 
presence of restriction site PvuII permits the direct insertion of the 
nucleotide sequence. It thus shows itself to be particularly useful as a 
cloning site for an active portion of a polypeptide which one hopes to insert 
into the translational [reading] frame within the albumin sequence at the 57 
residue level. 



In this mode of insertion, which makes use of a single restriction site already 
existing in the original albumin sequence, the ligation of the sequence 
coding for the active peptide with the restriction fragment corresponding to 
the totality of the gene coding for albumin, generates a nucleotide sequence 
comprising a hybrid gene coding for a strict insertion HAS-type chimeric 
protein. 

In the particular case of zones 419 to 430, the presence of 2 insertion sites 
(Hind i and ArvII) allows the peptides of biological interest to be inserted 
and/or substituted in the albumin sequence. 

With particular reference to regions 8 and 13, the insertion of the said 
sequence is favored if one or more restriction sites that can be manipulated 
are created for it beforehand. To be sure, the restriction sites to be created on 
the sequence level are chosen by taking into account the nature of the 
preexisting restriction sites. They must lead to a selective insertion. 

In the second embodiment, the use of two single restriction sites allows the 
construction of genes coding for chimeras having the active peptide inserted 
and/or substituted. The peptide insertion is made by replacement of a narrow 
fragment of two single restriction sites in the complementary ADN of the 
albumin, inserted by directed mutagenesis. These two single restriction sites 
can be Mst I and Kpn I in region 8 and Sst I and Xho I in region 13. The 
creation of these restriction sites may or may not modify the polypeptide 
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coding phase in this albumin gene may be exclusively the peptide 
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coding sequence, or may correspond to a blend of the coding sequence of the 
peptide and the coding sequence of the deleted albumin fragment. 

The present invention also aims to protect the variants of the nucleotide 
sequences coding for the corresponding albumin; that is, integrating at least 
one single non natural restriction site, not present in the original sequence. 

Creation of these restriction sites can additionally serve to advantage in 
inserting one or several active portions of biologically active polypeptides 
into the mature protein. 

More preferably, in the process that is the subject matter of the invention, the 
nucleotide sequence is part of an expression cassette comprising a 
transcription initiation region (promoter region), permitting expression of the 
nucleotide sequence in the host cells placed under its control and coding for 
the polypeptides in the invention. This region can arise from promoter 
regions of genes strongly expressed in the cellular host utilized, the 
expression being constitutive or regulated. With respect to yeasts, it may be 
the promoter of the gene for phosphoglycerate kinase (PKG); for 
glyceraldehyde-3 -phosphate dehydrogenase (GPD); lactase (LAC4); the 
enolases (ENO); the alcohol dehydrogenases (AOH), etc. As to bacteria, it 
may be the promoter of the right or left genes of the lambda bacteriophage 
(P L , P R ), or also the promoters of genes of the operons tryptophane (Ptrp) or 
lactose (Piac)- Moreover, this control region can be modified, for example by 
mutagenesis in vitro, by introduction of additional control elements or 
synthetic sequences, or by deletions or substitutions of the original control 
elements. The expression cassette can equally comprise a region of 
functional transcription termination in the host considered, positioned 
immediately ahead of the nucleotide sequence coding for a polypeptide 
embodied in the invention. 

In a preferred embodiment, the polypeptides of the invention result from the 
expression of a nucleotide sequence in a eukaryote or prokaryote host and 
the secretion of the expression product of the said sequence into the culture 
medium. In fact, it is particularly advantageous to be able to obtain the 
molecules by recombinant pathway directly in the culture medium. In this 
case, the nucleotide sequence coding for a polypeptide embodied in 
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the invention is preceded by a "leader" sequence (or signal sequence) 
directing the polypeptide arising in the secretion pathways of the host 
utilized. This "leader" sequence may be the natural signal sequence of the 
biologically active peptide in the case where this is a naturally secreted 
protein, or may be that of the stabilizing structure, but it may equally well 
consist of any other functional "leader" sequence, or of an artificial "leader" 
sequence. The choice of one or another of these sequences is guided in 
particular by the host utilized. Examples of functional signal sequences 
include those of the sexual pheromone genes or the "killer" toxins of yeasts. 

Besides the expression cassette, one or more markers permitting the 
selection of the recombinant host can be added, such as for example the gene 
URA 3 of the yeast S.Cerevisiae , or the genes conferring resistance to 
antibiotics such as geneticine (G418), or any other toxic compound such as 
certain metallic ions. 

The ensemble comprising the expression cassette and the selection marker 
may be introduced directly into the host cells in question, that is, inserted 
beforehand in a functionally self-replicating vector. In the first case, 
sequences homologous to the regions present in the genome of the host cells 
are preferably added to this ensemble; the said sequences being then 
positioned on each side of the expression cassette and of the selection gene 
as a way to increase the integration frequency of the ensemble in the host 
genome by targeting the integration of the sequences by homologous 
recombination. In the case where the expression cassette is inserted into a 
replicative system, a replication system preferred for yeasts of the type 
Kluweromvces is derived from the plasmid pKD 1 , initially isolated from K. 
drosophilianum . A replication system preferred for the yeasts of type 

Saccharomyces is derived from plasmid 2(i of S.Cerevisiae . Furthermore, 
this expression plasmid may contain all or part of the said replication 
systems, or may combine elements derived from plasmid pkDl as well as 
from plasmid 2(i. 

In addition, the expression plasmids may be shuttle vectors between a 
bacterial host such as Escherichia coli and the chosen cellular host. In this 
case, a replication origin and a selection marker functioning in the bacterial 
host are required. It is also possible to position the restriction sites of 
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around the bacterial sequences and only on the expression vector; this makes 
it possible to delete [these sequences] by cutting and religation in vitro of the 
truncated vector before transformation of the host cells, which may result in 
an increased number of copies and an enhanced stability of the expression 
plasmids in the said hosts. For example, these restriction sites may 
correspond to sequences such as 5'-GGCCNNNNNGGCC-3' (Sfil) (SEQ ID 
N°l) or 5'-GCGGCCGC-3' (Notl) (SEQ ID N°2), inasmuch as these sites 
are extremely rare and generally absent from an expression vector. 

After construction of such vectors or expression cassettes, they are 
introduced into the host cells retained according to the traditional techniques 
described in the literature. In this regard, any method permitting the 
introduction of a foreign ADN into a cell may be used. This includes in 
particular transformation, electroporation, conjugation or any other 
technique known to a person skilled in the art. As an example for the hosts 
of yeast type, the different strains of Kluweromyces utilized have been 
transformed by treating the intact cells in the presence of lithium acetate and 
polyethylene glycol, according to the technique described by Ito et al. [J. 
Bacteriol. 153 (1983) 183]. The transformation technique described by 
Durrens et al. [Curr. Genet. IS (1990) 7] utilizing ethylene glycol and 
dimethylsulfoxide has also been used. It is also possible to transform yeasts 
by electroporation, according to the method described by Karube et al. 
[FEBS Letters 182 (1985) 90]. An alternative protocol is also described in 
detail in the examples that follow. 

After the selection of transformed cells, the cells expressing the said 
polypeptides are inoculated and the said polypeptides may be recovered, 
either during cellular growth for the "ongoing" processes, or at the end of 
growth for batch cultures. The polypeptides that are the subject matter of the 
present invention are purified using the supernatant of the culture, with the 
aim of determining their molecular, pharmacokinetic and biological nature. 

A preferred system of expression of the polypeptides according to the 
invention consists of using the yeasts of the Kluweromyces type as cellular 
hosts, transformed by certain vectors derived from the extrachromosomic 
replicon pKDl initially isolated from K.marxianus var. drosophilianum . 
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replicating the said vectors in a stable manner and furthermore have the 
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advantage of being included in the G.R.A.S. (Generally Recognized As 
Safe) list of organisms. Preferred yeasts are preferably the industrial strains 
of the Kluweromvces type capable of replicating in stable fashion the said 
plasmids derived from plasmid pKDl and in which a selection marker and 
expression cassette have been inserted permitting the secretion of a high 
level of the polypeptides according to the invention. 

The present invention also relates to the nucleotide sequences coding for the 
chimeric polypeptides described earlier, as well as the recombinant cells, 
eukaryotes or prokaryotes, comprising such sequences. 

The present invention also relates to the application of the polypeptides 
according to the present invention as medicines. More particularly, the aim 
of the invention is any pharmaceutical composition comprising one or 
several polypeptides or nucleotide sequences such as are described earlier. 
These pharmaceutical compositions can be presented in various 
formulations. They may particularly involve nanoparticles having surfaces 
on which the polypeptides according to the invention are present. This type 
of formulation is utilized more particularly to achieve directed targeting of 
the active [molecule]. Of course, the nucleotide sequences can be used in 
gene therapy. 

The present invention will be described more completely with the aid of the 
examples that follow, which should be considered as illustrative and not 
limiting. 



LIST OF FIGURES 

The representations of plasmids indicated in the following figures are not 
drawn to scale and only the restriction sites that are important for 
understanding the clonings carried out are indicated. 

Figure I : Schematic representation of the domain I of albumin with location 
of the insertion sites according to the invention. * signals the location of the 
h2«h3 insertion site in domain III. The numbers 5, 8 and 13 identify the 
corresponding insertion zones. 
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Figure 2: Schematic representation of different modes of insertion of an 
active peptide in the structure of albumin. 

Figure 3: PlasmidpYGlOS. 

Figure 4: Modification of region 5 of albumin following the insertion of the 
sequence coding for IEGR, as described in example 8.1 The modifications 
are represented in bold type. The placement of a modified or introduced 
restriction site is indicated by a horizontal line, and the position of an 
enzyme cutting with a vertical line. 

Figure 5: Cloning strategy for insertion of IEGR in region 5 (example 8.1). 

Figure 6: Cloning strategy for insertion of an active peptide in region 13 of 
albumin. 

EXAMPLES 

GENERAL CLONING TECHNIQUES 

The traditional methods used in molecular biology, such as the preparative 
extractions of plasma ADN, centrifugation of plasma ADN in a cesium 
chloride gradient, electrophoresis on agar gels or acrylamide, purification of 
ADN fragments by electroelution, extraction of proteins in phenol or phenol- 
chloroform, precipitation of ADN in saline medium by ethanol or 
isopropanol, transformation in Escherichia coli , etc., are well known to a 
person skilled in the art and are abundantly described in the literature 
[Maniatis T. et al., "Molecular Cloning, a Laboratory Manual," Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y., 1982; Ausubel F.M. et al. 
(eds), "Current Protocols in Molecular Biology," John Wiley & Sons, New 
York, 1987.] 

Restriction enzymes were supplied by New England Biolabs (Biolabs), 
Bethesda Research Laboratories (BRL) or Amersham, and are used 
according to the recommendations of the suppliers. 

Plasmids of type pBR322, pUC and the Ml 3 series phages are commercial 
in origin (Bethesda Research Laboratories). 
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For the ligations, the ADN fragments are separated according to size by 
electrophoresis on agar gel or acrylamide, extracted with phenol or a 
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phenol/chloroform mixture, precipitated with ethanol and incubated in the 
presence of the ADN ligase of phage T4 (Biolabs) according to the 
supplier's recommendations. 

The filling of prominent 5 5 extremities is achieved by the Klenow fragment 
of E. coli ADN polymerase (Biolabs), used according to the supplier's 
specifications. The prominent 3 5 extremities are destroyed in the presence of 
the ADN polymerase of T4 phage (Biolabs) according to the manufacturer's 
recommendations. The prominent 5' extremities are destroyed by a 
treatment brought about with SI nuclease. 

Directed mutagenesis in vivo with synthetic oligodeoxynucleotides is 
achieved according to the method developed by Taylor et al. [Nucleic Acids 
Res. 13 (1985) 8749-8784], using the kit distributed by Amersham. In the 
context of the present invention, this technique is utilized in particular to 
create single restriction sites for use in a subsequent insertion. 

Enzymatic amplification of ADN fragments by the technique called PCR 
[Polymerase-catalyzed Chain Reaction, Saiki R.K. et al., Science 230 (1985) 
1350-1354; Mullis K.B. and Faloona F.A., Math. Enzym. 155 (1987) 335- 
350] is carried out with the use of a "DNA thermal cycler" (Perkin Elmer 
Cetus) according to the manufacturer's specifications. 

The nucleotide sequences are verified using the method developed by 
Sanger et al. [Proc. Natl. Acad. Sci. USA, 74 (1977) 5463-5487], using the 
kit distributed by Amersham. 

K.lactis is transformed using the ADN from the expression plasmids of the 
proteins of the present invention by any technique known to a person skilled 
in the art. An example is given in the text. 

Except where otherwise indicated, the bacterial strains used are E.coli 
MC1060 QaclPOZYA, X74, galU, galK, sfrA 1 ), or E.coli TGI (lac, proA,B, 
sup E, thi, hsdD5/ F'traD36, proA*B + , lacl Q , lacZ, Ml 5). 

The yeast strains used are those of budding yeast, in particular yeasts of the 
Kluweromvces type. The K. lactis MW98-8C strain (a. uraA, aro, lys, K + , 
pKDl e ) and K. lactis CBS 293.91 were used in particular. A sample of the 
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MW98-8C strain was deposited on September 16, 1988 at Centraalbureau 
voor Schimmelkulturen (CBS), Baara (Netherlands) where it was registered 
under the number CBS 579.88. 
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A bacterial strain (R_coli) transformed with the plasmid pET-8c52K was 
deposited on April 17, 1990 with the American Type Culture Collection 
under the number ATCC 68306. 

Yeast strains transformed by the expression plasmids coding for the proteins 
of the present invention are cultivated in Erlenmeyer flasks or 2 1 pilot 
fermenters (SETRIC, France), in rich medium (YPD: 1% yeast extract, 2% 
Bactopeptone, 2% glucose; or YPL: 1% yeast extract, 2% Bactopeptone, 2% 
lactose) under constant stirring at 28°C. 



EXAMPLE 1 : 

PROTOCOL FOR STRICT PEPTIDE INSERTION UTILIZING A 
SINGLE RESTRICTION SITE PRESENT IN THE HSA GENE. 



Due to its uniqueness in the HSA gene and the associated vector, the site 
PvuII, located naturally in the coding sequence, is particularly useful as a 
cloning site for a biologically active peptide that one wishes to insert in 
translation phase into the HSA at the level of the 58 residue. In a particular 
embodiment, it is useful to employ peptides from p residues for which the 
coding sequence is [3xN] p . In this case, the oligonucleotides synthesized are 
of type 5'-NN[3xN] p N-3* and its complementary strand. The ligation of this 
fragment with the restriction fragment Hindl ll- Hindl ll A PvuII, 
corresponding to the totality of the gene coding for the HSA, generates a 
Hind lll- Hind lll restriction fragment comprising a hybrid gene coding for a 
chimeric protein of strict insertion type HSA. In another embodiment, the 
peptide may be repeated several times in the chimera. 

EXAMPLE 2: 

TOTAL OR PARTIAL PEPTIDE INSERTION PROTOCOL UTILIZING 
TWO SINGLE AND NATURAL RESTRICTION SITES. 



In a particular embodiment, the use of two single restriction sites allows for 
the creation of genes coding for chimeras having the active peptide in strict 
insertion or partial insertion. 



« 
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For example, the existence of single Hind i and Avr ll sites in the coding 
sequence of HS A and in the vector allows an Hind ffl- Hind lll A HincII- Avrll 
fragment to be generated. The elimination of the nucleotide fragment Hind i- 
Avr ll corresponds to the deletion of the peptide fragment T(420) - N(429). 
The use of an appropriate complementary oligonucleotide permits a peptide 
in cloning phase to be cloned in the HSA gene. This restriction fragment 
may be exclusively the complementary coding sequence of the peptide, or 
may correspond to a blend of the coding sequence of the active peptide and 
the coding sequence of the T(420) - N(429) fragment of the HSA. The 
active peptide may be present several times in the chimera. 

EXAMPLE 3 

EXPRESSION PLASMIDS 



The chimeric proteins of the preceding examples may be expressed in the 
yeast, starting from functional promoters, regulated or constitutive, such as, 
for example, those present in the plasmids pYG105 (promoter LAC 4 of 
Kluweromvces lactis), pYG106 (promoter PGK of Saccharomvces 
cerevisiae), pYG536 (promoter PH05 of S. cerevisiae), or the hybrid 
promoters such as those described in the patent application EP 361 991. The 
plasmids pYG105 and pYG106 are particularly useful here because they 
permit the expression of genes coded by the restriction fragments Hindlll 
that are described in the preceding examples and are cloned in the Hindlll 
site and in productive orientation (defined as the orientation that places the 
"prepro" region of the albumin in proximal fashion relative to the 
transcription promoter), from the functional promoters in K. lactis , whether 
regulatable (pYG105), or constituent (pYG106). The plasmid pYG105 
corresponds to the plasmid pKan707 described in patent application EP 361 
991, in which the restriction site Hindlll, single and located in the gene for 
resistance to geneticine (G418), has been destroyed by directed mutagenesis 
while preserving a protein unchanged (oligodeoxynucleotide 5'- 
GAAATGC ATAAGCTCTTGCC ATTCTC ACCG-3 ' (SEQ ID No. 3). The 
fragment Sall-SacI coding for the gene URA 3 of the transferred plasmid was 
replaced by a restriction fragment Sall-SacI, comprising an expression 
cassette consisting of the promoter LAC4 of K. lactis (in the form of a 
fragment Sall- Hindl lD and the terminator of the PGK gene of S. cerevisiae 
(in the form of a fragment Hindlll-SacI). The plasmid pYG105 is very stable 
mitotically in the Kluweromvces yeasts. It 
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is represented in Figure 3. The plasmids pYG105 and pYG106 do not differ 
from each other than in the nature of the transcription promoter encoded by 
the fragment Sall- Hind llL 



WO 95/30759 

18. 

EXAMPLE 4: 

TRANSFORMATION OF YEASTS 
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The transformation of yeasts belonging to the Kluyveromyces type, and 
particularly the strains MW98-8C and CBS 293.91 of K. lactis , is carried 
out, for example, by the technique of treating the intact cells with lithium 
acetatye [Ito H. et aL, J. Bacteriol. 153 (1983) 163-168], adapted as follows. 
The growth of the cells takes place at 28°C in 50 ml of YPD medium 
through stirring and up to an optical density between 0.6 and 0.8 at 600 nm 
(OD600); the cells are harvested by centrifugation at low speed, washed in a 
sterile solution of TE (10 mM Tris HC1 pH 7.4; 1 mM EDTA), resuspended 
in 3-4 ml of lithium acetate (0.1 M in the TE) to obtain a cellular density of 
about 2 x 10 8 cells/ml, then incubated at 30°C for 1 hour with moderate 
stirring. Aliquots of 0.1 ml of the resulting suspension of competent cells are 
incubated at 30°C for 1 hour in the presence of ADN and at a final 
concentration of 35% polyethylene glycol (PEG^oo, Sigma). After a thermal 
shock of 5 minutes at 42°C, the cells are washed twice, resuspended in 0.2 
ml of sterile water and incubated for 16 hours at 28°C in 2 ml of YPD 
medium to permit the phenotypic expression of the gene conferring 
resistance to G418, expressed under control of the promoter P kl (cf. EP 361 

991); 200 (il of the cellular suspension is then spread over YPD selective 

containers (G418, 200|u,g/ml). The containers are put into incubation at 28°C 
and the transformed cells appear after 2 to 3 hours of cellular growth. 

EXAMPLE 5 : 

SECRETION OF CHIMERAS 

After selection in rich medium supplemented with G418, the recombinant 
clones are tested for their capacity to secrete the mature form of chimeric 
proteins. Clones corresponding to the CBS 293.91 or MW98-8C strain, 
transformed by the expression plasmids of the chimeras between the HSA 
and the biologically active portion are incubated in YPD or YPL medium at 
28°C. Cells in the supernatant are retrieved by centrifugation when the cells 
reach stationary growth phase, concentrated tenfold if necessary by 
precipitation for 30 minutes at 20°C in a final concentration of 60% ethanol, 
and tested after electrophoresis on SDS-PAGE 0 gel, either directly by 
coloration of the gel by coomassie blue, or after 
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immunoblot, using primary antibodies directed against the biologically 
active portion or a polyclonal rabbit serum directed against the HSA. During 
the immunologic detection experiments, the nitrocellulose filter is first 
incubated in the presence of the specific primary antibody, washed several 
times, incubated in the presence of goat antibodies directed against the 
primary antibody, then incubated in the presence of an avidine-peroxidase 
complex utilizing the "ABC kit" distributed by Vectastain (Biosys S.A., 
Compiegne, France). The immunologic reaction is then revealed by addition 
of diamino-3,3' benzidine tetrachlorhydrate (Prolabo) in the presence of 
oxygenated water, according to the manufacturer's recommendations. 



EXAMPLE 6: 

PURIFICATION OF CHIMERAS 



The chimeras present in the supernatants of the cultures corresponding to the 
transformed strain CBS 293.91 are characterized at a first stage with the aid 
of antibodies specific to the HSA part. It may be desirable to purify some of 
these chimeras. The culture is then centrifuged (10,000 g, 30 min), the 
supernatant is passed through a 0.22 mm filter (Millipore), then concentrated 
by ultrafiltration (Amicon) using a membrane for which the discrimination 
threshold is at 30 kDa. The concentrate obtained is then dialyzed using a 
solution of Tris HC1 (50 mM, pH 8) and purified by column. For example, 
the concentrate corresponding to the supernatant of the culture from the 
transformed strain CBS 293.91 is purified by affinity chromatography on 
Trisacryl Blue (IBF), and the samples are next dialyzed using water. 
Purification by molecular sieve may then be carried out. In this case, a 
Superose 12 column (Pharmacia) is equilibrated beforehand with a buffer of 
20 mM NaH 2 P0 4 and 100 mM NaCl at pH 7.0, and the samples resulting 
from the affinity chromatography are loaded on the column, harvested and 
characterized. Purification by ion exchange chromatography may also be 
used. In certain cases, the concentrate obtained after ultrafiltration is 
dialyzed using a solution of Tris HC1 (50 mM, pH 8), then deposited in 20 
ml fractions on a cation exchange column (5 ml) (S Fast Flow, Pharmacia) 
equilibrated with the same buffer. The column is then washed several times 
with the Tris HC1 solution (50 mM, pH 8), and the chimera protein is then 
eluted in the column by an NaCl gradient (0 to 1 M). 
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The fractions containing the chimera protein are then combined and dialyzed 
using a solution of Tris HC1 50mM (pH 8) and redeposited on the S Fast 
Flow column. After elution of the column, the fractions containing the 
protein are combined, dialyzed using water and lyophilized before 
characterization. 



EXAMPLE 7: 

INSERTION OF PEPTIDE 1 1 IN THE ALBUMIN SEQUENCE 

Peptide 1 1 was described as an epitote of tryptophan synthase (Larvor et al. 3 
Mol. Immunol. (1991) 28, 523-531). 

1 . Strict insertion in a region naturally having single restriction sites 

Peptide 1 1 has the following peptide sequence in the terminal N a C sense: 
HGRVGIYFGMK (SEQ ID No.20). The strategy of insertion in the Hinc II- 
Avr ll zone has consisted of creating a ligation between a synthesized 
nucleotide fragment coding for peptide 1 1 such that the reading frame for 
the sequence coding for HSA is respected, and the nature of the sequence 
coding for HSA is as it originally was. In a first stage we have thus 
synthesized the following two oligonucleotides: 5'- 
CCATGGTAGAGTAGGTATCTATTTCGGTATGAAAACTCCAACT 
CTTGTAGAGGTCTCGAGAAAT-3 5 (SEQ ID No.4) and 5'- 
CTAGATTTCTCGAGACCTCTACAAGATGTGGAGTTTTCATACCGA 
AATAGATACCTACTCTACC ATGG-3 ' (SEQ ID No.5). 
These two oligonucleotides have been hybridized and then ligated from the 

fragment Hindlll-Hindlll A Hinc II- Avr IL thus generating the totality of 
the HSA gene comprising the total insertion of peptide 11 between S(419) 
and T(420), immediately preceding the "prepro" exportation region of the 
HSA. This fragment is cloned in productive orientation and in the Hindlll 
site of the pYG105 plasmid. 
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2. Insertion by substitution and addition in a region naturally having single 
restriction sites 

In another embodiment, the oligonucleotides synthesized were the 
following: 5 '-CCATGGTAGAGTAGGTATCTATTTCGGTATGAAA-3 5 
(SEQ ID No.6), and 

5 CTAGTTTCATACCGAAATAGATACCTACTTCTTACCATGG-3 5 
(SEQ ID No. 7). These two oligonucleotides are hybridized and then ligated 

to the fragment Hindl ll- Hind lH A HincII-Avrll, thus generating the gene 
of a chimera severed from the residue T(420) to N(429) and substituted by 
the sequence of peptide 11. This fragment is cloned in productive orientation 
and in the Hindlll site of the pYG105 plasmid. 

3. Strict insertion in a region not having single restriction sites 

In another embodiment, two single restriction sites, Sst I and Xho 1, have 
been created by directed mutagenesis. The same type of directional cloning 
was achieved. In the case of total insertion of peptide 1 1 between residue 
A(191) and S(192), the following two oligonucleotides were synthesized: 
5 ' - ACGGG ATG AAGGG AAGGCCC ATGGT AG AGT AGGT ATCT ATTT 
CGGTATGAAA-3 ' (SEQ ID No.8), and 

5'-TCGATTTCATTACCGAAATAGATACCTACTCTACCATGGGCCTT 
CCCTTC ATCCCGTAGCT-3 5 (SEQ ID No.9). 

One then proceeds according to the protocols already described in the 
preceding points 1 or 2 to obtain the corresponding expression plasmid. 

4. Insertion by substitution in a region not having single restriction sites 

The insertion requires the preliminary creation of two restriction sites, SstI 
and Xhol. For the partial insertion itself, the following two oligonucleotides 
were synthesized: 

5 '-ACATGGTAGAGTAGGTATCTATTTCGGTATGAAA-3 ' 
(SEQ ID No. 10), and 

5 '-TCGATTTCATACCGAAATAGATACCTACTCTACCATGTAGCT-3 ' 
(SEQ ID No. 11). 

In this last case, the residues R(186) to A(191) are deleted and substituted by 
the peptide 11. The two expression plasmids of these chimeras are 1671 and 
1667, respectively. 



WO 95/30759 PCT/FR95/00520 

Each of the plasmids obtained in points 1 , 2, 3 and 4 is used to transform a 
yeast strain, according to the protocol described in example 4. The 
corresponding proteins are secreted and purified as in examples 5 and 6. 

EXAMPLE 8: 

INSERTION OF PEPTIDE SEQUENCE IEGR 3 THE SUBSTRATE OF 
FACTOR Xa 

The insertion of the peptide containing the sequence IEGR (SEQ ID No.21) 
comes next, target of the protease factor Xa, which turns prothrombin into 
thrombin in the cascade of reactions involved in the coagulation of blood. 

1 . Strict insertion in region 5 having a single PvuII site. 

To achieve this insertion at the PvuII site of the albumin gene, a first step is 
to create a replicative vector lacking a PvuII site, in which the gene coding 
for the prepro albumin is then inserted. The following two complementary 
oligonucleotides, coding for the sequence IEGR, have also been synthesized: 
5 GATCCATAGAAGGTCGACTAG-3 ' (SEQ ID No. 1 2), and 
3 CTAGGTATCTTCCAGCTGATC-5 ' (SEQ ID No. 1 3). 
These two oligonucleotides are then hybridized and inserted at the level of 
site PvuII in the albumin gene. The modifications in the nucleotide and 
peptide sequences of the albumin that their insertion causes to appear are 
presented in Figure 4. The cloning strategy is schematized in Figure 5. 
This construction illustrates the case where the junction sequences are 
introduced on either side of the peptide at issue. 

2. Insertion in region 13. 

Figure 6 depicts a cloning strategy for a peptide in region 13. 
In what follows, the restriction sites Sst I and Xho 1 were created by directed 
mutagenesis. In the case of region 13, a total substitution and an addition 
substitution of the IEGR sequence were carried out respectively, either 
simple or surrounded by the junction sequences. 
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In regard to the total substitution, the oligonucleotides utilized are the 
following: 

5 ' -C AGAATCG AAGGT AG AGCC-3 5 (SEQ ID No. 14), and 
5 '-TCGAGGCTCTACCTTCGATCGAGGGTAGCT-3 > (SEQ ID NO. 1 5). 
On the protein level, the sequence (187)DEGK (SEQ ID No.22) is 
substituted by IEGR. 

In the second case, the oligonucleotides utilized are the following: 
5 ' -ACCCTCGATCG AAGGT AG ATCTCCA-3 5 (SEQ IDNo.16) 
5 ' -TCG ATGG AG ATCT ACCTTCG ATCG AGGGT AGCT-3 5 
(SEQ ID No. 17) 

On the protein level, the albumin is severed from residue R(186) to residue 
A(191) and replaced by the sequence PSIEGRSP (SEQ ID No.23), thus 
leading to an addition of two residues. 

The corresponding proteins are secreted and purified according to the 
protocols described in the preceding examples. Table 1 below lists the 
characteristics of these chimeras. 

Table 1 

Albumin Expression Rate Purification Yield Final Purity 
Incorporating |ug/l 



IEGR 150 15 98 

PSIEGRPS 110 36 96 



3. Biological activity of the chimera obtained according to example 8.1 

The chimera is incubated for 3 hours at 37°C in the presence of bovine factor 
Xa in an enzyme/ substrate ratio of 1/10 and a buffer of Tris 50mM, NaCl 
lOOmM, and CaCl 2 ImM, at pH 8.0. At the conclusion of this treatment, an 
SDS PAGE analysis is performed that provides evidence for cleavage of the 
chimera, characterized by the generation of an albumin fragment with a 
mass of the order of 80 kD. 
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LIST OF SEQUENCES 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: RHONE-POULENC RORER S. A. 

(B) STREET: 20, avenue Raymond ARON 

(C) CITY: ANTONY 

(D) COUNTRY: FRANCE 

(E) ZIP CODE; 92165 

(ii) TITLE OF THE INVENTION: New biologically active polypeptides, 
their preparation and pharmaceutical compositions containing them. 

(iii) NUMBER OF SEQUENCES: 24 

(iv) LEGIBLE FORM FOR COMPUTER: 

(A) TYPE OF SUPPORT: TAPE 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1 .0, Version #1 .25 (OEB) 

(2) INFORMATION FOR SEQ ID NO.l: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF SEQUENCE: SEQ ID NO. 1 
GGCCNNNNNG GCC 

(3) INFORMATION FOR SEQ ID NO.2: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRTPTION OF THE SEQUENCE: SEQ ID NO.2 



GCGGCCGC 
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(4) INFORMATION FOR SEQ ID N0.3: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ED N0.3 
GAAATGCATA AGCTCTTGCC ATTCTCACCG 

(5) INFORMATION FOR SEQ ID NO.4: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO.4 

CCATGGTAGA GTAGGTATCT ATTTCGGTAT GAAAACTCCA ACTCTTGTAG 
AGGTCTCGAG AAAT 

(6) INFORMATION FOR SEQ ID NO. 5: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS simple: 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO.5 

CTAGATTTCT CGAGACCTCT ACAAGATGTG GAGTTTTCAT ACCGAAATAG 
ATACCTACTC TACCATGG 



WO 95/30759 PCT/FR95/00520 

26. 

(7) INFORMATION FOR SEQ ID N0.6: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO.6 
CCATGGTAGA GTAGGTATCT ATTTCGGTAT GAAA 

(8) INFORMATION FOR SEQ ID NO. 7: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO.7 
CTAGTTTCAT ACCGAAATAG ATACCTACTT CTTACCATGG 

(9) INFORMATION FOR SEQ ID NO. 8: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO.8 

ACGGGATGAA GGGAAGGCCC ATGGTAGAGT AGGTATCTAT TTCGGTATGA 
AA 

(10) INFORMATION FOR THE SEQ ID NO.9: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 
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(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 
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(xi)DESCRlPTION OF THE SEQUENCE: SEQ ID NO.9 

TCGATTTCAT TACCGAAATA GATACCTACT CTACCATGGG CCTTCCCTTC 
ATCCCGTAGC T 



(11) INFORMATION FOR SEQ ID NO. 10: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 10 
ACATGGTAGA GTAGGTATCT ATTTCGGTAT GAAA 

(12) INFORMATION FOR SEQ ID NO.ll: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
(QNUMBER OF STRANDS: simple 
(D)CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRTPTION OF THE SEQUENCE: SEQ ID NO.ll 
TCGATTTCAT ACCGAAATAG ATACCTACTC TACCATGTAG CT 

(13) INFORMATION FOR SEQ ED NO.12: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(QNUMBER OF STRANDS: simple 
(D)CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO.12 
GATCC ATAGA AGGTCGACTA G 

(14) INFORMATION FOR SEQ ID NO. 13: 

(i)CHARACTERISTICS OF THE SEQUENCE: 



WO 95/30759 



PCT/FR95/00520 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 



WO 95/30759 PCT/FR95/00520 

28 

(C) NUMBER OF STRANDS: simple 
(E) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 13 
CTAGGTATCT TCCAGCTGAT C 

(15) INFORMATION FOR SEQ ID NO. 14: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 14 
CAGAATCGAA GGTAGAGCC 

(16) INFORMATION FOR SEQ ID NO. 15: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 15 

TCGAGGCTCT ACCTTCGATC GAGGGTAGCT 

17) INFORMATION FOR SEQ ID NO. 16: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(i ii) ANTI- SENSE : NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 16 
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(18) INFORMATION FOR SEQ ID NO. 1 7: 

(i) CHARACTERISTICS OF THE SEQUENCE: 
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(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 17 
TCGATGGAGA TCTACCTTCG ATCGAGGGTA GCT 

1 9) INFORMATION FOR SEQ ID NO. 1 8 : 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCPJPTION OF THE SEQUENCE: SEQ ID NO. 18 
GTCCCGGATG GAGCGCGTAC TTAGAGAGAA T 

20) INFORMATION FOR SEQ ID NO. 1 9 : 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) NUMBER OF STRANDS: simple 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: ADNc 

(iii) HYPOTHETICAL: NO 
(iii)ANTI-SENSE: NO 

(xi)DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 19 
TCGACAGGGC CTACCTCGCG CATGAATCTC TCTTAAGCT 

21) INFORMATION FOR SEQ ID NO. 20: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 11 amino acids 

(B) TYPE : amino acid 

(C) CONFIGURATION: linear 

(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 20 



His Gly Arg Val Gly He Tyr Phe Gly Met Lys 
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22) INFORMATION FOR SEQ ID NO. 2 1 : 

(i) CHARACTERISITCS OF THE SEQUENCE: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) CONFIGURATION: linear 

(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 21 
He Glu Gly Arg 

23) INFORMATION FOR SEQ ID NO. 22: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) CONFIGURATION: linear 

(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 22 

Asp Glu Gly Lys 

24) INFORMATION FOR SEQ ID NO . 23 : 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) CONFIGURATION: linear 

(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 23 
Pro Ser He Glu Gly Arg Ser Pro 

25) INFORMATION FOR SEQ ID NO. 24: 

(i) CHARACTERISTICS OF THE SEQUENCE: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) CONFIGURATION: linear 

(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO. 24 



Arg Met Glu Arg Val Leu Arg Glu Asn 
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CLAIMS 

1 . Recombinant polypeptide comprising at least one active portion derived from a 
polypeptide, natural or synthetic, biologically active, genetically inserted in an albumin or one of 
its variants or derivatives. 

2. Polypeptide according to claim 1, characterized in that the biologically active peptide has 
therapeutic activity and is of human origin. 

3. Polypeptide according to claim 2, characterized in that the polypeptide having a 
therapeutic activity is chosen from among all or part of the enzymes, enzyme inhibitors, 
antigenes, antibodies, hormones, receptors, coagulation factors, interferons, cytokines, growth 
and/or differentiation factors, factors involved in the genesis/resorption of bone tissue, 
chemotactic factors, motility or cellular migration factors, cytostatic factors, bactericidal or 
antifungal factors, or plasmatic, interstitial or extracellular matrix adhesive molecules. 

4. Polypeptide according to one of claims 1 through 3, characterized in that the polypeptide 
having a therapeutic activity is chosen from among any antagonist or agonist peptide sequence 
having molecular and/or cellular interaction involved in the pathologies of circulatory and 
interstitial compartments. 

5. Polypeptide according to one of claims 1 through 4, characterized in that the active 
portion has a structure chosen from among: 

a) the entire peptide structure or, 

b) a fragment of (a) or a structure derived from (a) by structural modification (mutation, 
substitution, addition and/or deletion of one or more residues) and preserving therapeutic 
activity. 

6. Polypeptide according to one of claims 1 through 5, characterized in that the active 
portion is inserted strictly into the interior of the albumin or surrounded by junction sequences. 
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7. Polypeptide according to one of claims 1 through 6, characterized in that the active 
portion is inserted preferably at the level of those regions on the albumin that presumedly form 
regions exposed to the surface of the molecule. 

8. Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of region 5 extending from residues 57 to 62 of the albumin. 

9. Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of region 8 extending from residues 103 to 120 of the albumin. 

10. Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of region 13 extending from residues 178 to 200 of the albumin. 

11. Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of the region from residue 415 to residue 425 delimited by the 
helices h2 and h3 of domain EE in the albumin. 

12. Polypeptide according to one of claims 1 through 12, characterized in that the active 
portion is inserted in a single or multiple manner. 

13. Polypeptide according to claim 12 characterized in that the active portion is repeated 
several times in the same place and/or in different regions of the albumin. 

14. Polypeptide according to one of claims 1 through 13, characterized in that the active 
portions inserted are different in nature. 

15. Variant of a nucleotide sequence coding for the albumin or one of its variants or 
derivatives including at least one single non-natural restriction site. 

16. Nucleotide sequence coding for a polypeptide according to any of claims 1 to 14. 

17. Nucleotide sequence according to claim 16 characterized in that it comprises a "leader" 
sequence allowing the secretion of the expressed polypeptide. 



33 

18. Expression cassette comprising a nucleotide sequence according to claim 16 or 17 under 
the control of a transcription initiation region and possibly a transcription termination region. 

19. Self-replicating plasmid comprising an expression cassette according to claim 18. 

20. Eukaryote or prokaryote recombinant cell in which a nucleotide sequence according to 
claim 16 or 17 or an expression cassette according to claim 1 8 or a plasmid according to claim 
19 has been inserted. 

21. Recombinant cell according to claim 20 characterized in that it is a yeast, an animal cell, 
a fungus or a bacterium. 

22. Recombinant cell according to claim 21 characterized in that it is a yeast. 

23. Recombinant cell according to claim 22 characterized in that it is a yeast of the type 
Saccharomvces or Kluweromvces . 

24. Process for preparing a polypeptide as defined in one of claims 1 to 14 characterized in 
that a recombinant cell is cultivated according to one of claims 20 to 23 under expression 
conditions, and the polypeptide produced is recovered. 

25. Pharmaceutical composition comprising one or more peptides according to any of claims 
1 to 14. 

26. Pharmaceutical composition comprising a nucleotide sequence according to either claim 
1 6 or 1 7 that may be used in gene therapy. 
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CLAIMS 

1 . Recombinant polypeptide comprising at least one active portion derived from a 
polypeptide, natural or synthetic, biologically active, genetically inserted in an albumin or one of 
its variants or derivatives. 

2. Polypeptide according to claim 1, characterized in that the biologically active peptide has 
therapeutic activity and is of human origin. 

3. Polypeptide according to claim 2, characterized in that the polypeptide having a 
therapeutic activity is chosen from among all or part of the enzymes, enzyme inhibitors, 
antigenes, antibodies, hormones, receptors, coagulation factors, interferons, cytokines, growth 
and/or differentiation factors, factors involved in the genesis/resorption of bone tissue, 
chemotactic factors, motility or cellular migration factors, cytostatic factors, bactericidal or 
antifungal factors, or plasmatic, interstitial or extracellular matrix adhesive molecules. 

4. Polypeptide according to one of claims 1 through 3, characterized in that the polypeptide 
having a therapeutic activity is chosen from among any antagonist or agonist peptide sequence 
having molecular and/or cellular interaction involved in the pathologies of circulatory and 
interstitial compartments. 

5. Polypeptide according to one of claims 1 through 4, characterized in that the active 
portion has a structure chosen from among: 

a) the entire peptide structure or, 

b) a fragment of (a) or a structure derived from (a) by structural modification (mutation, 
substitution, addition and/or deletion of one or more residues) and preserving therapeutic 
activity. 

6. Polypeptide according to one of claims 1 through 5, characterized in that the active 
portion is inserted strictly into the interior of the albumin or surrounded by junction sequences. 
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7. Polypeptide according to one of claims 1 through 6, characterized in that the active 
portion is inserted preferably at the level of those regions on the albumin that presumedly form 
regions exposed to the surface of the molecule. 

8. Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of region 5 extending from residues 57 to 62 of the albumin. 

9. Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of region 8 extending from residues 103 to 120 of the albumin. 

10. Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of region 13 extending from residues 178 to 200 of the albumin. 

1 1 . Polypeptide according to one of claims 1 through 7, characterized in that the active 
portion is inserted at the level of the region from residue 415 to residue 425 delimited by the 
helices h2 and h3 of domain HI in the albumin. 

12. Polypeptide according to one of claims 1 through 12, characterized in that the active 
portion is inserted in a single or multiple manner. 

13. Polypeptide according to claim 12 characterized in that the active portion is repeated 
several times in the same place and/or in different regions of the albumin. 

14. Polypeptide according to one of claims 1 through 13, characterized in that the active 
portions inserted are different in nature. 

15. Variant of a nucleotide sequence coding for the albumin or one of its variants or 
derivatives including at least one single non-natural restriction site. 

16. Nucleotide sequence coding for a polypeptide according to any of claims 1 to 14. 

17. Nucleotide sequence according to claim 16 characterized in that it comprises a "leader" 
sequence allowing the secretion of the expressed polypeptide. 



33 

18. Expression cassette comprising a nucleotide sequence according to claim 16 or 17 under 
the control of a transcription initiation region and possibly a transcription termination region. 

19. Self-replicating plasmid comprising an expression cassette according to claim 18. 

20. Eukaryote or prokaryote recombinant cell in which a nucleotide sequence according to 
claim 16 or 17 or an expression cassette according to claim 18 or a plasmid according to claim 
19 has been inserted. 

21 . Recombinant cell according to claim 20 characterized in that it is a yeast, an animal cell, 
a fungus or a bacterium. 

22. Recombinant cell according to claim 21 characterized in that it is a yeast. 

23. Recombinant cell according to claim 22 characterized in that it is a yeast of the type 
Saccharomvces or Kluweromvces . 

24. Process for preparing a polypeptide as defined in one of claims 1 to 14 characterized in 
that a recombinant cell is cultivated according to one of claims 20 to 23 under expression 
conditions, and the polypeptide produced is recovered. 

25. Pharmaceutical composition comprising one or more peptides according to any of claims 
1 to 14. 

26. Pharmaceutical composition comprising a nucleotide sequence according to either claim 
1 6 or 1 7 that may be used in gene therapy. 
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